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Abstract 


Quantum Bayesian networks with application to games displaying Parrondo’s paradox 

by 

Michael Pejic 

Doctor of Philosophy in Mathematics 
University of California, Berkeley 
Professor F. Alberto Griinbaum, Chair 


Bayesian networks and their accompanying graphical models are widely used for prediction 
and analysis across many disciplines. We will reformulate these in terms of linear maps. This 
reformulation will suggest a natural extension, which we will show is equivalent to standard 
textbook quantum mechanics. Therefore, this extension will be termed quantum. However, 
the term quantum should not be taken to imply this extension is necessarily only of utility in 
situations traditionally thought of as in the domain of quantum mechanics. In principle, it 
may be employed in any modeling situation, say forecasting the weather or the stock market- 
it is up to experiment to determine if this extension is useful in practice. Even restricting 
to the domain of quantum mechanics, with this new formulation the advantages of Bayesian 
networks can be maintained for models incorporating quantum and mixed classical-quantum 
behavior. The use of these will be illustrated by various basic examples. 

Parrondo’s paradox refers to the situation where two, multi-round games with a hxed 
winning criteria, both with probability greater than one-half for one player to win, are 
combined. Using a possibly biased coin to determine the rule to employ for each round, 
paradoxically, the previously losing player now wins the combined game with probability 
greater than one-half. Using the extended Bayesian networks, we will formulate and analyze 
classical observed, classical hidden, and quantum versions of a game that displays this para¬ 
dox, Ending bounds for the discrepancy from naive expectations for the occurrence of the 
paradox. A quantum paradox inspired by Parrondo’s paradox will also be analyzed. We will 
prove a bound for the discrepancy from naive expectations for this paradox as well. Games 
involving quantum walks that achieve this bound will be presented. 
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Chapter 1 
Introduction 


Outline of the work 

Bayesian networks and graphical models are useful for classical systems because they are 
much more intuitive than a list of conditional dependencies. It is also sometimes useful to 
introduce additional, hypothesized nodes to break complicated dependencies into simpler, 
potentially universal modules. Usually these are treated just as observable nodes which 
are always hidden; however, this imposes constraints that are metaphysical in origin and 
raises difficulties of interpretation. We will give an alternate approach using linear maps 
on measures, with additional constructions to those generally utilized in graphical models, 
that resolves those issues. While, in certain situations, this introduces additional maps, not 
previously available, these new maps are not only in and of themselves of limited interest, 
but also introduce undesired complications. However, what is extremely fruitful is simply the 
conceptual leap. Thinking in terms of linear maps on spaces of measures immediately raises 
the question of looking at linear maps on other spaces. This approach leads to a natural ex¬ 
tension, which we will prove is equivalent to standard, textbook quantum mechanics (which 
is infamous for its apparently unmotivated and incomprehensible formulation). Therefore, 
this extension will be termed quantum. To avoid being swept away by a flood of details, 
propositions of a more general nature, together with their proofs, needed to show the sensi¬ 
bility and consistency of the extension and its equivalence to quantum mechanics are placed 
in appendices. 

However, the term quantum should not be taken to imply this extension is necessarily 
only of utility in situations traditionally thought of as in the domain of quantum mechanics. 
In principle, it may be employed in any modeling situation, say forecasting weather or 
stock prices-it is up to experiment to determine if this extension is useful in practice. In 
particular, there is no reason for H to necessarily enter into these models if they are outside 
the realm of physics. Even restricting to the traditional domain of quantum mechanics, with 
this new formulation the advantages of Bayesian networks can be maintained for models 
incorporating quantum and mixed classical-quantum behavior. The use of these will be 
illustrated by various examples. In particular, we will show that some of the supposed 
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hallmarks of quantum mechanics, no-cloning and teleportation, apply for classical hidden 
systems as well. 

In the second part, we will utilize these extended Bayesian networks in the study of 
various games displaying Parrondo’s paradox-the phenomenon of two games each winning 
for one player with probability greater than one-half, yet their convex combination (in a 
sense to be specihed) paradoxically winning for the previously losing player with probability 
greater than one-half. We will prove bounds for the discrepancy from naive expectations for 
classical versions of a game; those bounds will then be shown to be broken by a quantum 
analogue of the game. 


31 


A quantum paradox inspired by Parrondo’s paradox will also be analyzed. We will prove 
a bound for the discrepancy from naive expectations for this paradox. Games involving 
quantum walks that achieve this bound will be presented. 


Philosophical interlude—rejection of metaphysics 

For a man will attain unto nothing more perfect than to be found to be most 
learned in the ignorance which is distinctly his. The more he knows that he is 
unknowing, the more learned he will be.-Nicholas of Cusa 


33 


Natural scientists may adopt whatever attitude they please; they are still under 
the domination of philosophy.-F. Engels 


24 


Physicists generally pretend to ignore philosophy, but since physics (unlike mathematics) 
deals with knowledge of external reality, it is not possible to avoid epistemology. A commonly 
expressed (although often unacknowledged) philosophy is that of naive realism: While other 
sciences have to deal with human-created concepts, physics is a “god’s-eye view” of reality as 
it really is. This is presented pictorially in Figure 1.1. However, there are several problems 
with this: (i) it does not agree with what physicists actually do in practice 12 [^; (m) 


it needlessly encumbers physics with many questions that are metaphysical in nature-the 
direction of time, the reason for existence of nature, the collapse of the wavefunction, and 
so on; and, most importantly for our work in this dissertation, [in) it makes it seem as if 
quantum is a kind of physics with no connection to modeling. 

The philosophy we employ in this work is one with a long-standing pedigree 


123 : We 


know nothing about reality as it really is; hence, external truths are necessarily tentative 
and model-dependent. This is presented pictorially in Figure 1.2. This credo was widely 
disseminated by Pragmatists in the late nineteenth century who were influenced by the 
methodology of science 


122 60 


34 35 


philosophers of science in the century since 46 36 


and it has continued to be advocated by 
3^ [nl |45l 1^ g. The beneht 


of this philosophy is that not only does it resolve the hrst two questions raised above, but, 
most pertinent to this present work, it shows quantum is actually a sort of model. Quantum 
mechanics is then the overlap of physics with this class of models. 
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Figure 1.1: Traditional conception of the relation of physics to Bayesian networks. 



Figure 1.2: Proposed relation of physics to Bayesian networks. 




CHAPTER 1. INTRODUCTION 


4 


There is no reason to believe reality is doing calculations at all similar to the ones we 
employ in our hypothesized modeller even that it is doing calculations at all. In particular 
there is the issue of contextuality-we want to employ potentially universal modules in our 
models since only they have predictive ability in novel situations, but perhaps reality is 
fundamentally contextual 

Constraints imposed on our models 

In keeping with the expressed philosophy, no metaphysical constraints arising from claims 
to know what is “really going on” will be placed upon the mathematical operations and 
constructs that can be employed in the models. Rather, there are only three rules that will 
be enforced. Firstly, the quantities calculated by the mathematical models must be inter¬ 
pretable as probabilities; in particular, they must be positiv^ Secondly, the mathematical 
models must be composed of linear maps (which we will show is the weakening of a principle 
already in wide use, if not always acknowledged). Thirdly, the mathematical models must 
be composed of potentially universal modules (in a manner that we will precisely dehne). 
Of course, the particular model employed in a particular situation may fail to be universal 
when actually employed in a different context; the point is that this failure should be as a 
result of experiment and not be preordained as result of our choice of mathematics employed 
in modeling. 

These constraints are extremely restrictive. For maps to be linear, they must clearly 
live on linear spaces. Furthermore, the constraints also impose strong restrictions on the 
linear spaces these maps live on. Thus far, we are only aware of three classes of linear 
spaces that meet the imposed restrictions: (i) certain subspaces of measures; (ii) density 
matrices on complex Hilbert spaces ; and (Hi) tensor products of these. The former gives 
what is traditionally thought of as classical behavior and we will prove the latter two give 
behavior that has traditionally been taken the domain of quantum mechanics. However, 
since we do not start from quantum mechanics, but instead only from the above principles, 
models involving maps on density matrices may be found to be of utility in situations not 
traditionally thought of as related to quantum mechanics. 


^ Human mathematics is an abstraction from our experiences as embodied beings 45 -perhaps reality 
uses a form of mathematics that cannot be conceived of in this manner. 

^Gravity-a long-range force that cannot be screened-is a clue that this may indeed be the case. Isolated 
subsystems are only possible since gravity is so weak it can be generally ignored. 

^To aid readability, positive is used instead of nonnegative throughout. Wherever strict positivity is 
required, the word strict will be added. 
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Part I 

Quantum Bayesian networks 
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Chapter 2 

Bayesian networks-graphical models 


2.1 Graphical models as the form in which 
information is presented 


There are many ways to present joint probability in terms of other quantities and mathe¬ 
matical constructs. Graphical models 37 are a useful way to sort through what otherwise 
can seem hopelessly complicated in the usual notation. For instance, suppose we are given 
a probability space (r2,T,7r), with Q a set, S a cx-algebraof subsets of Q, and vr : T —)■ M 
a probability measure. Furthermore suppose we have two generalized random variables, 
df : G —)■ Tf and F : G —)■ 3^ for sets X and 3^. Then, for any A G X{S) and B eY (T), the 
joint probability Prob(X G A and Y G i?) is tt (X“^(y4) fl Y~^{B)). The graphical model 
corresponding to presenting the joint probability in this manner-namely by giving (G, tt, 
df, X, 3^, F)-is 



Y 


( 2 . 1 ) 


where the double arrows stand for deterministic causation. 

The resulting joint probability determines a probability space (X x y, p) with the 
probability measure p given on rectangular subsets by p{A x B) = 'k{X~^{A) U Y~^{B)), 
which can be readily extended to a probability measure for the cx-algebra X = [X x y){£). 
The graphical model corresponding to presenting the joint probability in this manner-namely 

a-algebra is a collection of subsets of a set X, including both 0 and X, that is closed under relative 
complementation and countable unions. 
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by giving {X x 3^, X, p)-is 




( 2 . 2 ) 


For each B G Y{£), the probability for X determines a probability space {X,Q,pb) 
with cr-algebra Q = X{£) and with probability measure ps given by Pb{,A) = p{A x B) = 
7t{X~^{A) U F“^(i?)) = Prob {X E A and Y G B). The marginal probability for X is then 
given by p = py. Let ^ x 3^ signify the a-algebra of all rectangular sets of the form Axy 
for A G Since p is a hnite measure (hence, a-hn it^ and Pb is absolutely continuou^ 
with respect to p, by the Radon-Nikodym theorem |72j , the measures {pb} determine the 
conditional probability (shown in two common notations) t{B\Q x 3^) (a;) or t{B\x) as the 

Radon-Nikodym derivative -^(x) 


dp 


dpB 


The Radon-Nikodym derivative —— (x) is actually an equivalence class of functions 

dp 


Let 


L^{X] p) denote the space of equivalence classes of /x-measurable function^ that differ on p- 
null subsets. As usual, for p G [0, cxd), let L^(A’; p) be the space of “functions” / in p) 

such that \ f\^ dp is hnite (the value of the integral is independent of the representative 
function in the equivalence class). For p G [l,oo), these are Banach spaces for the usual 
norm 



\f\Pdp 


(2.3) 


Furthermore, let L°°{X]p) be the space of “functions” / in LP{X]p) that are essentially 
boundecj^in magnitude. This is a Banach space with the norm ||/|| being the least essential 

bound on 1/1. Then = t{B\x) is actually a “function” in L^iX]p) r\L°°iX]p). This 

dp 

space is clearly isometrically isomorphic to L^{X x 3^; p\gxy) F L°°{X x 3^; p\gxy)i which is 
emphasized by the notation t{B\Q x y){x). 


measure /r on set X is a-finite if there are a countable collection of /r-measurable subsets {Bj} such 
that Bj = X with each iJ.{Bj) finite. 

^For measure spaces {X,£,fnu) and (A,£,nM), ly is absolutely continuous with respect to yi if p{A) = 0 
implies v{A) = 0 for all subsets A E S. 

^By abuse of nomenclature it is by convention termed a “function”-we will maintain the quotes for this 
latter case since we need to maintain the distinction. 

®For a given measure space (X, £, y.), let £o be the cr-algebra given by the completion of £ with respect to 
y (see [73| ). The extension of y to £o will still be denoted y. Then we will say ^ is a E-measure if is a sub- 
cr-algebra of £q. An element of £o will be termed y-measurable. A function / : A —> KU {oo, —oo, undefined} 
is y-measurable if it takes the later three values only on a /i-null subset and is elsewhere measurable with 
respect to £o in the usual sense. 

® Bounded on all but a /r-null subset 
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Let H be the a-algebra Y(S). For any disjoint, countable collection {Bj} C "H, 

B, = P(- X IJ Bj) = ^ p(. X Bj) = ^ iXB, (2.4) 


with convergence in norm. Since 


j 

d/iB 


dfj, 




— \\Tb\\, 




■^ = ^ (Bj 


(2.5) 


with convergence in L^{A-, fi)-noTm. Hence, the conditional probability r(-|-) is a L^(A;/j,)- 
valued vector measur^on 3^. 

Then the joint probability Prob(X G A and Y E B) is given by 


t{B\x) dfi{x) 


( 2 . 6 ) 


' x&A 


The directed graphical model corresponding to presenting the joint probability in this manner- 
namely by giving {X, Q, y, "H,/!, T{-\-))-is 


X, 


B- 


■© 


Y 


(2.7) 


Let u be the marginal probability for F, z/(i?) = p{X x B). Then r(-|-) is absolutely 
continuous with respect to u in the sense that t{B\-) is the zero “function” for every B such 
that I'iB) = 0. However, unless we are in the common case where the cx-algebra on X is 
generated by a countable collection of atom^ L^{X]p) does not have the Radon-Nikodym 
propertjj^ |107] (the example given above demonstrates this); hence, there is in general no 
/ G L^{y-, tJ~IA{X] fi)) (which by Fubini’s theorem is the same as L^{X x y-, p x u) for 
the product measur^^ p x v) such that the joint probability Prob(X G A and Y E B) is 
given by x i^){x,y). 

Lastly, dehning C('l') symmetrically to r(-|-), the directed graphical model corresponding 
to presenting the joint probability 


Prob(X G H and F G R) = 


(;(A\y)dAy) 


( 2 . 8 ) 


ly&B 


vector measure is a countably-additive set function with values in a Banach space where the conver¬ 
gence for the countably-additivity is in norm. 

®A set in a set algebra is an atom if it is indivisible in the set algebra. 

®A Banach space B has the Radon-Nikodym property if, for any B-valued vector measure on a set X 
which is absolutely continuous with respect to some tr-finite measure /r on A, there is a Bochner integrable, 
B-valued “function”, such that iy{A) = X 4 ^ for any j/-measurable A. 

Unfortunately, by convention the tensor product of measures is called the product measure and written 
using X instead of the more appropriate 0 (however, see 111 for a use of the latter notation). 
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by giving the marginal probability z/ and the conditional probability C('l') is 

As an example, consider calculating the conditional probability 

Proh(Y e B and X e A) 

Prob {Y e B\X eA) = --tt- - 

^ ^ Prob(X e A) 

(for Prob(X G A) 7 ^ 0-otherwise the joint probability does not determine the conditional 
probability) using the information presented in the manner corresponding to each of the four 
graphical models (the filled circle indicates which node is being conditioned on): 


(2.9) 


( 2 . 10 ) 


x e 



Y 


7r{X-\A)nY-\B)) 

vr(^-H^)) 


( 2 . 11 ) 


X e A 


-e 


Y 


X eA 




Y 


X eA 


-e 


Y 


p{AxB) 

p{Axy) 


/i(A) 


/^ 6 bC(A|i/) dujy) 
Iy(.yC{A\y) dv{y) 


( 2 . 12 ) 


(2.13) 


(2.14) 


For instance, Bayes’ theorem is simply the calculation corresponding to the presentation of 
information by the last graphical model. 


A similar situation holds for any hnite number of random variables 69 , with a graphical 


model with deterministic causation emanating from a fundamental, hidden probability space, 
a graphical model with a clique of all the nodes for the random variables, and various directed 
models with probabilistic causation. For the common case where the various a-algebras are 
generated by hnitely many atoms, the various measures become vectors, the conditional 
probabilities become stochastic matriceA^ or tensors, and the integrations become sums. 


’^A matrix is stochastic if all entries are either positive or zero and all column sums are one. 
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2.2 Transition probability functions 


Depending on how the information for the calculation of the joint probability is presented, 


we may imagine different ways of varying it. For (2.1), it is most natural to imagine indepen¬ 


dently varying the maps X and Y. For (2.2), there is nothing to independently vary other 


than the joint probability itself. For (2.7), we would like to imagine varying the marginal 
probability fi and the conditional probability t(-|-) independently. This is a problem because 
the space r(-|-) lives in-the L^(A’;p)-valued vector measures-depends on /r. This problem 
will exist even in the commonly occurring case where the a-algebra Q is generated by a 
countable collection of atoms if /i is zero on some atom (other than the empty set), since 
then the conditional probability when conditioning on that atom is not well-dehned. 

One solution to this problem, following 
notion: 


69 , is provided by introducing the following 


Definition 2.2.1 For cr-algebras Q on A and TL on y, a function r(-|-):'HxA’—j-Mis 
a transition probability function if: (i) for each x & A, 'r(-|x) is a probability measure on 
y with event cr-algebra PL; and (ii) for each B & PL, '^(-^1') is a bounded, ^-measurable 
function on A. 


By taking 'r(-|-) to be a transition probability function rather than a conditional probability, 
it is now possible to vary fi and r(-|-) independently. 

Following (Ml, one interpretation of the transition probability function is a function on A 


with values in the probability measures on 3^; naively, one is tempted to imagine t{-\x) is the 
probability given X = x, although this is meaningless unless the marginal probability pi has 
a strictly positive mass atom at x. Note the function is not generally Bochner integrabl^^ 
unless the a-algebra "H on 3^ is generated by a countable collection of atoms. For example, 
ii A = y and the random variables X and Y are the same, then t{B\x) = 1b(x) = 6x{B), 
where the second interpretation as t{-\x) = 6x is not Bochner integrable unless the a-algebra 
on A is generated by a countable collection of atoms. 

Also note that the interpretation as a function-valued measure is potentially problematic 
since the result is not generally a vector measure because convergence for countable additivity 
is generally pointwise rather than in norm (using the supremum norm which is natural for 
functions); for instance, consider the preceding example. By the Dominated Convergence 
theorem, however, this pointwise convergence is adequate for integration. 


75 


We now raise the question of whether it is always possible to upgrade a conditional 
probability to a transition probability function. Under rather broad conditions, this is indeed 
possible. Firstly, to make things precise we have the following dehnition: 


^^For any Banach space B, a B-valued function on a set X with a measure y is Bochner integrable if there 
is a sequence of simple functions (functions taking only finitely many values with each value achieved on a 
set with finite /x-measure) converging to it, both pointwise in B-norm almost everywhere and in D{X; y; B)- 


norm. 
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Definition 2.2.2 A transition probability function is a lift of a conditional probability 
if G i^{B\-) for every subset B in the cr-algebra. 

Then, by A3.7, for any conditional probability there is a lift to a transition probability 
function (not unique in general) given either of the following sufficient conditions (which are 
generally met in practice): (i) the a-algebra Tf on A’ is generated by countably many atoms; 
(u) y is an uncountable, complete, separable, metric space with Ti the Borel a-algebra; or 
[iii) the a-algebra ^ on A is generated by countably many atoms. See A3.10 for additional 
sufficient conditions. 


Removing metaphysical constraints 


Imagining one can vary /i and keep the transition probability function r(-|-) fixed, there is an 
intuitive interpretation of 'r(-|-) as an idealized conditional probabilitjj^ with t{B\x) being 
the probability to observe B given the event {x}, even if the latter has probability zero or 
is not even in the event cr-algebra (although in this latter case it is constant within any 
atom). This leads to a metaphysical notion of the actual existence of a variable taking an 
actual value with probability reflecting our ignorance of its value. This may be of value for 
nodes in a graphical model which are observable; however, it is common to add hypothesized, 
hidden nodes to a directed graphical model in order to (hopefully) break it up into smaller. 


more manageable pieces 38 . For these, there is no justification to necessarily limit oneself 
to transition probability functions. Furthermore, the probabilities and conditional proba¬ 
bilities involving the hidden nodes lack any meaning in either the Bayesian or frequentist 
interpretations-the word probability then only means positive and norm-one. 

For a fixed choice of t(-|-), there is a convex linear map L from probability measures on 
X to probability measures on y given by 


{Lii){B) = / t{B\x) dp,{x) 


(2.15) 


'x&X 


The map L extends to a linear map on more general measures and signed measures. Hence, 
another solution (among many others) to the above posed problem is therefore to take the 
independently varied objects to be the marginal probability /i and the linear map L. Not 
all linear maps that take probability measures to probability measures are necessarily in¬ 


duced by some transition probability function as in (2.15) (however, they are all induced by 
some pseudo-transition ‘function”-see B2.7, B2.8, and B2.9), so this generally introduces 
additional maps. These additional maps, involving operations such as Lebesgue decompo¬ 
sition 76 , are not in and of themselves of great interest; in fact the raise rather undesired 
complications (see ((3.2). Furthermore, in the common case where the a-algebras are gen¬ 


erated by countably many atoms, any linear map is induced by some transition probability 
function, so there are no additional maps. However, what is fruitful is the conceptual shift; 
as will be explored in the following chapter, once we are thinking in terms of linear maps. 


noted in 69 . 
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we are immediately drawn to consider the question of looking at linear maps between spaces 
other than spaces of measures. 

2.3 Graphical models as constraints 

In addition to showing the form in which information is presented, graphical models can 
also show constraints on the information in a far simpler form than the usual notation. For 
instance, consider the Markov chain with three random variables X, Y, Z. For the graphical 
model 



with the joint probability Prob(X G A and Y E B and Z E C) given by tt{X~^{A) fl 
Y~^{B) n Z~^{C)), it is necessary to explicitly add the constraint that for every C E Z(S), 
the conditional probability r(C|a;,?/) = for nc{AxB) = {X~^{A)r[Y~^{B)nZ~^{C)) 

and jj, = nzi Li independent of x (in the almost-everywhere, probabilistic sense). If, as above, 
we imagine varying the maps X and F, it is not at all clear how to do this while maintaining 
the constraint. 

Similarly, for the graphical model 



(2.17) 


with the joint probability Prob(X G A and Y E B and Z E C) given by p{A x i? x C), it 
is necessary to explicitly add the constraint that for every C eT = Z[E-\ the conditional 
probability T(C\x,y) = for x B) = p{A x B x C) and p = pz, is independent of 
X (in the almost-every where, probabilistic sense). If, as above, we imagine varying the joint 
probability p, it is not at all clear how to do this while maintaining the constraint. 

However, consider the directed graphical model; 


■^©- 


(2.18) 




CHAPTER 2. BAYESIAN NETWORKS-GRAPHICAL MODELS 


13 


which corresponds to presenting the information to calculate the joint probability as 0, 
and t(-|-) where 

Prob(X G A,Y G -B, and Z G C) = f T{C\y) dn{x,y) = f T{C\y) d^{y) (2.19) 

J{x,y)^AxB Jy^B 

for the marginal probabilities y on A x y and ^ on 3^ given by 

y{A xB)= [ r]{B\x) d(t){x),i{B) = y{X x B) = [ ri{B\x) d(t){x) (2.20) 

J xGA J x^X 

The restriction is that the conditional probability t(-|-) is independent of x (in the almost- 
everywhere, probabilistic sense). It is not necessary to give this constraint explicitly since it 
is indicated by the graphical model through the lack of an arrow directly from X to Z. If, 
as above, we take and t(-|-) as transition probability functions instead of conditional 
probabilities, it is now easy to see how to vary 0, ?7(-|')) '^('l') while maintaining the 

constraint-namely, by only allowing t(-|-) that are independent of x. 

Note if the wrong directed model is chosen, the constraint can be masked. For instance, 
for the graphical model 



( 2 . 21 ) 


which corresponds to presenting the information to calculate the joint probability as u, 6'(-|-), 
and C('l') where 


Prob(X G A, y G S, and Z E C) = / ({A\y, z) dn^y, z) (2.22) 

J{y,z}&BxC 

for the marginal probability k on 3^ x ^ given by 

k{BxC)=[ e{B\z)du{z) (2.23) 

J z&Cx 

Once again, it is necessary to explicitly add the constraint that for every C E Z, the condi¬ 
tional probability r(C'|a;, y) = ^, for ycA x B) = J(^y^^)(,Bxc and y = yz, 

is independent of x (in the almost-everywhere, probabilistic sense). 

This can be readily generalized to more complicated graphical models-any graph that is 
not simply a cliqu^^of all the nodes implies constraints on the allowed joint probabilities. 
In this manner the various dependencies are displayed in a far more intuitive manner than 
through a long list of opaque constraints. Of course, it is always possible to impose additional 
constraints explicitly. 


clique is a group of nodes that are all connected to one another. 
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2.4 Directed graphical models as tensor networks 

Tensor networks are a commonly employed, diagrammatic device for contracting tensors and 
vectors. For the common case where the various a-algebras are generated by hnitely many 
atoms, a directed graphical model together with all its information corresponds to a tensor 
network or, if conditioning is present, the ratio of tensor networks. If the conditioning is only 
on nodes without parents, the denominator is necessarily one, so these can also be considered 
tensor networks. Each node in the graphical model with either no children or only one child 
becomes one node in the tensor network. For nodes in the graphical model with multiple 
children, it is best to replace them with two nodes, one of which takes in all the inputs and 
has a single connection to the other, which is a copying or diagonal node that is zero unless 
all its connections are the same, when it has the value one, which branches out to all the 
outputs. 

For example, consider the graphical model; 



with corresponding information Prob(X = x), Prob(E = y), Prob(E = v\X = x,Y = y), 
Prob(Z = z\V = v), and Prob(IF = w\V = v), so the conditional probability Prob(X = 
x,Y = y, Z = z\W = w) 

(Prob(X = x) Prob(F = y) Prob(E = v\X = x,Y = y) 

Prob(Z = zjV = v) Prob(IF = w\V = v)) ^ 

(Prob(X = x') Prob(F = y') Prob(E = = x', E = y') 

Prob(IF = w\V = P)) 

The corresponding tensor network for the numerator is: 



where A^-y^ = Prob(X = a;)Prob(F = y) Prob(E = v\X = x,Y = y), which is equal to 
Prob(E = v,X = x,Y = y), the Copy tensor is zero unless all its subscript are equal, in 
which case it has value one, B^w = Prob(IF = w\V = v) and C^z = Prob(Z = z\V = v). 
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The value for the tensor network is then 


^ ^ ■^XyV^^HYvv'v"^v'wCv"z ^ ^ AxynBy^C^ 

v,v\v" v^V 


(2.27) 


which equals the numerator. For the denominator, the tensor network in this case is simply 


D, 


-w 


(2.28) 


where 


Dy,= Prob(X = x) Prob(F = y) Prob(l/ = v\X = x,Y = y)PToh{W = w\V = v) 

x&X ,y&y ,v&V 

(2.29) 

which is equal to Prob(hF = w). 

As this example illustrates, the advantages of the Bayesian network over the tensor net¬ 
work are that: (i) it is possible to show which nodes are being observed, marginalized, or 
conditioned on; and (ii) the nodes in the Bayesian network have a more intuitive interpre¬ 
tation. On the other hand, the tensor network does highlight the importance of copying for 
there to be multiple child nodes, which will be important later for incorporating quantum 
nodes (see 13.2 


13.4, and 14.1). 


2.5 The Copy map and restriction maps 

Going along with the linear maps on measures induced by transition probability function 
(|2T5|), we have the following additional useful linear maps for the evaluation of the 


tsee 

joint probability for a directed graphical model. For a set A with cr-algebra S, there is 
a Copy map from T-measures on A to J^-measures on A x A, where IF is the a-algebra 
generated by the rectangular sets £ x £. It is given by, for any set A G -F and F-measure /i, 
Copy(/i)(A) = iJi{{x G A\{x,x) G A}). It is induced by the transition probability function 
r(-|-) given by 

1 if (x,x) G A 
0 otherwise 


t{A\x) = 


(2.30) 


This can clearly be generalized for creating any hnite number of copies. 

For each A G T, there is a restriction map, which is an idempotent, sending T-measures 
on A to T-measureA® on A, /i —)■ 1^/i = /r(A fl ■). It is induced by the transition probability 


function r(-|-) given by t{A\x) = 1 a{,x). 

adopt the convention that a function (or “function”) before a measure, fy, is the signed measure 

fyiA) = lAfdy- 
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2.6 Determinism 

An additional restriction that can be placed on the directed graphical models is one of 
determinism, which we will interpret to mean that each node with a parent is associated to 
a transition probability function (rather than merely a conditional probability) and that all 
transition probability functions take values only zero or one. In that way, all uncertainty is 
due to the input probability measures coming from parentless nodes, which can be interpreted 
as arising from our uncertainty on initial values which are imagined to actually have specihc, 
unknown values. For rather broad conditions (specihed in the following), it is always possible 
to replace a directed graphical model with one that is deterministic and gives identical 
results for the joint probability. For each transition probability function, we add an auxiliary 
measure space, namely [0,1] with the standard topology, the Borel a-algebra, and Lebesgue 
measure as its marginal probability. We then constrict a deterministic transition probability 
function involving this additional space such that when the auxiliary space is marginalized, 
we recover the original transition probability function. Graphically, this means making the 
replacement for each node with a parent 




(2.31) 

where the double arrows are used to indicate the associated transition probability function 
is deterministic. 

By A3. 8, sufficient conditions for this are given by similar sufficient conditions (which 
are generally met in practice) for the existence of a lift in ^2.2 so either: (i) the a-algebra 
on 3^ is generated by countably many atoms; or (ii) y is an uncountable, complete, 
separable, metric space with TL the Borel a-algebra. Then for any transition probability 
function r(-|-) : TL x A —)■ [0,1] there is a deterministic transition probability function 
^('l') • X [0,1] —)• {0,1} such that 


t{B\x) = / ^{B\x,s) dX{s) 

2se[o,i] 

where A is Lebesgue measure on M. By A3.9, deterministic transition probability functions 
are induced by point transformations, which is another common interpretation of determin¬ 
ism. 

Of course, just because we can restrict to working with deterministic models does not 
mean we must or that we should-there are many arbitrary choices made in the construction 
of '^(■l') and it is more straightforward in general to simply work directly with t(-|-)- Hence, 


we will discard the restriction of determinism. (Similarly, in 15.1, we will show commonly 


employed restrictions for quantum models, such as unitarity, are equally without merit.) 
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Chapter 3 

Hidden classic and quantum nodes 


3.1 Principles of linearity and potential universality 


The traditional approach to using Bayesian networks with directed graphical models gives 
a marginal probability measure for each inputQ node and a transition probability function 
for each of the remaining nodes. Then the following principle 47 is utilized, which is so 


reasonable it almost always goes unmentioned, but is just implicitly assumed: 


Principle 3.1.1—Measurement independence The input marginal probabilities can be 
varied independently of each other and the transition probability functions]^ 

As an example of both the utility and reasonableness of this principle, consider a box with 
a horn and a switch. Suppose we know the horn buzzes with 0.9 probability if the switch 
is in the left position, and buzzes with 0.3 probability if the switch is in the right position. 
The graphical model for this is: 


r 


L 


■ — — — — — — — — — -i 

switch horn | 

•-^ box I 

. _ _ _ _ _ _ _ _ _ J 


(3.1) 


Now suppose we have a balanced stick that falls with equal probability to the left or the 
right when we release it. Now suppose we arrange it so it activates the switch when it falls. 
The graphical model now is: 


stick I switch horn | 

0 I ^ ^ box I 


^All nodes without parents. 

^Also known as the free choice or free will principle. 


(3.2) 
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The initial probability measure here is 0.5 to fall to the right and 0.5 to fall to the left. The 
action of the stick on the switch is deterministic, as is indicated by the double arrow. Once 
the switch has been activated, which we observe, the incoming probability measure will be 
either 1 for the switch to be right, 0 to be left or 0 for the switch to be right, 1 to be left, 
so we can reuse the transition probability function from above. Then, marginalizing the 
observations of which way the stick fell and the resulting position of the switch, there is a 
0.5 ■ 0.9 + 0.5 ■ 0.3 = 0.6 marginal probability the horn will sound. 

Now consider placing a screen that blocks our view of which way the stick falls, but still 
allows us to hear the horn if it sounds. The graphical model is 


stick 

(> 


switch 

-O 


horn 


box ' 


(3.3) 

If we wish to predict the marginal probability the horn will sound, we have a problem. We 
are given the behavior of the box for incoming probability measures either: 1 for the switch 
to be right, 0 to be left; or 0 for the switch to be right, 1 to be left-but not for 0.5 for the 
switch to be right, 0.5 to be left. This is where we must appeal to the preceding principle, 
which gives that there is still 0.5 ■ 0.9 + 0.5 ■ 0.3 = 0.6 probability the horn will sound. 

From the metaphysical viewpoint this principle makes perfect sense. If there are actually 
existing variables that take actual values (such as the position of the switch), observed 
outcomes depend only these and not on probability measures, which are only a reflection of 
our ignorance. As has already been commented on, this justihcation fails for hidden nodes, 
which are hypothetical constructs we introduce. Without this metaphysical backing, the 
principle is actually far stronger than what is required in that it assumes the existence of 
transition probability functions. 


If the principle is assumed to hold, then, from (2.15), the calculation of the joint prob 


ability reduces to some combination of composition and tensor product of of linear maps, 
involving both those induced by the given transition probability functions, restriction maps, 
and possibly the Copy map (the last two of which are also induced by certain transition 
probability functions-see 1|2.5[). By A3.2 and A3.3, this calculation is well-dehned. For 


instance, the calculation of the joint probability for (2.21), with 6'(-|-) and C('l') transition 
probability functions, can be given a^ 

(Ra o K o {{Rb o L) (g) /) o Copy o Rcu){A) (3.4) 


^We adopt the usual mathematical convention of maps acting on the left. The opposite convention of 
maps acting on the right is also commonly employed in the literature for the classical observed case 


62 


^We will follow the convention that the tensor product of vector spaces consists of all finite Imear 
combinations (the algebraic tensor product) except if both are Hilbert spaces, in which case it is the Hilbert 
space given by the completion using the standard induced inner-product. The tensor product K®Loi linear 
maps L : A ^ C and if : B —D is, for closed, linear spaces E C A 0 B and F C C 0 F, the set of all linear 
maps M : E -> F which, when restricted to A 0 B, agree with K ® L. If this set consists of a single map, 
K ® L is termed well-defined. 













CHAPTER 3. HIDDEN CLASSIC AND QUANTUM NODES 


19 


for L the map induced by 6*(-|-), K the map induced by C(-|-), / the identity map on measures, 
and Ra, Rb, Rc restriction maps. By introducing an initializing map, Lj, on the trivial 
measure spac^ which is isomorphic to M, with constant value fi and a terminal map, Lt, 
which evaluates the measure on X (hence, is a map to the trivial measure space), this can 
be written purely in terms of maps: 

Lt o Ra o K o {{Rb o L) ® I) o Copy o R(j o Lj (3.5) 


Later, we will introduce hidden nodes of a special form to account for initializing or termi¬ 
nating maps (see 14.1). 

Therefore, the joint probability is a multilinear functional on the input marginal prob¬ 
abilities. Furthermore, by a convergence theorem for sequences of measures 77 , it is not 


just linear for hnite linear combinations in each marginal measure, but absolutely convergent 
countable linear combinations as well. Hence, for any one input node, if <h is the functional 
(with all the marginal probability measures for other nodes fixed) and (/ij) is a sequence of 


marginal probability measures for that node with Ylj \\Lj\\ finite, then <F fijj = ^Lj- 
It is obvious that it would still be possible to make the calculation for the above example with 
the box based simply on this linearity property, which is weaker than the above principle. 

As we have already mentioned, we will consider hidden nodes associated to maps on 
spaces other than the space of measures. Therefore, we introduce the following weaker and 
more general version of the above principle: 


Principle 3.1.2—Linearity The maps for a Bayesian network are linear and bounded 
(hence, continuous). 


Moreover, we want the maps for graph fragments to be universal in the sense that the 
modules (such as the box in the preceding example) can be used to make predictions in novel 
situations. While this may fail in practice, this should be as a result of experiment and not 
be preordained by the mathematical models employed. We insist, therefore, on potential 
universality in the sense that any possible tensor product (not just those for a particular 
network) of the linear maps employed should always be well-defined. 

Principle 3.1.3—Potential universality The space of linear maps employed must be 
such that any tensor product of maps in the space is well-defined. 

The repercussions of the latter two principles will be studied in the following. 


measure on a set X is trivial if the only subsets in its tr-algebra are {0,X}. 
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3.2 Options I and II 


A problem arising from potential universality 

Generalizing from maps induced by transition probability functions to more general linear 
maps, composition is not an issue. However, the principle of potential universality does not 
hold in general for maps on measures on specihed sets for specihed a-algebras of events. One 
problem is the lack of uniqueness. Suppose one has the identity map / on Bore|^ measures 
on the interval [0,1] with the usual topology. Dehne I0l to be the set of all bounded, linear 
maps L on the Borel measures on [0,1] x [0,1], with the usual product topology, such that, 
restricted to product measures, p x i/, L(p x u) = (Jp) x (lu) = jjlx u. One obvious member 
of / ® / is simply the identity map on the Borel measures on [0,1] x [0,1]. This is the only 
weak*-continuou^map in the set. Another map in the set is given by A' : p ^ \/{p \\ (/iX z/)), 
where the supremum is taken over all product, hnite, Borel measures and p || (p x z/) is the 
part of p absolutely continuous with respect to p x u using Lebesgue decomposition [7^ . 
This is a well-dehned map by B2.1 that differs from the identity map. For instance, let p 
be the diagonal Lebesgue measure. 


p{A) = A ({a G [0,1] I (a, a) G A}) 


(3.6) 


Then the identity map sends p to itself, whereas Kp = 0 (see B1.8 for details). 

Another possible problem is the lack of existence. Not every linear map can be extended. 
For example, the space of sequences with limit zero, cq, is a norm-closecQ weak*-dens^ 
subset of the bounded sequences, loo- However, there is no extension of the identity map 
Co —t Co to a projection loo cq. 124| However, it is not clear whether there is a similar 
problem with extending the tensor product of linear maps. 


Resolving the problem—two options 


There are various ways to resolve this dilemma. One is to revert to only considering linear 
maps induced by transition probability functions, which works by A3.3. We will not pursue 
this approach since it has no ready generalization to linear maps on spaces other than those 
of measures. Instead, we will consider two approaches which do. The hrst, which will be 
termed option I, is to limit the space of measures so as to eliminate measures such as p 


®The Borel a-algebra on a topological space is that generated by the open subsets. 
^Using the Riesz theorem 


78 


which states that Radon (defined in the following) measures on a compact 
set are dual to the continuous functions on that set. A measure on a topological space is inner regular if 
the measure of any set is approximated by the measure of compact sets it contains. It is outer regular if the 
measure of any set is approximated by the open sets that contain it. A measure is Radon if it is Borel and 
inner regular. If the space is compact and Hausdorff, and the measure is finite, then it is also necessarily 
outer regular. If the space is compact, Hausdorff, metric, and separable, then Borel measures are necessarily 
Radon 


79 


^Usmg the supremum norm. 
®Using the duality of £i and G 
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above. The alternative, which will be termed option II, is to impose additional structure on 
the sets and then to limit the space of maps, so as to eliminate maps such as K above. 

To implement option I, a basic property that we want for our subsets of measures is 
dehned by: 


Definition 3.2.1 A subset A of measures is absolutely-continuous-complete if, for any p, 
in A, all measures absolutely continuous with respect to p are also in A. 

By the Radon-Nikodym theorem and the density of simple function^ in L^-spaces, a subset 
of finite measures A will have this property if, given any measure p G A, all the restric¬ 
tions p\e over /r-measurable subsets E are also in A. Hence, this property is the minimal 
requirement to show that a map is indeed positive. Then from A1.2, A1.3, B1.3, B1.4, 
and B2.5, the necessary and sufficient condition to implement option I is that the tensor 
product of the subsets of measures for each set is norm-dens^^in the subset of measures for 
the product set. 

A sufficient method, termed option I’, to insure this is met is for each set to have an 
associated base measure. The base measure need not be hnite, but it must be cr-finite. Then 
the associated subset of measures is the set of all hnite measures absolutely continuous with 
respect to the base measure, which by the Radon-Nikodym theorem is equivalent to the space 
of Lhfunctions with respect to the base measure. The base measure for the direct product 
of sets must be the product of the base measures for the individual sets. The sufficiency of 
this prescription is given by B3.1. 

Another sufficient method is to instead use the space of atomic measures^ If there are 
uncountably many atoms in the a-algebra, this is distinct from option I’; otherwise, the 
counting measure that assigns one to each atom is a base measure. The sufficiency of this 
prescription is given by B6.1 and B6.2. It is also possible to combine these two sufficient 
approaches, say by using the atomic, L^(A’;/r)-valued vector T-measures on 3^. This is 
sufficient by B3.1, B6.1, and B6.2. 

The implementation of option II is more straightforward. Each set has a topological 
structure that makes it a compact, Hausdorff space. By Tychonoff’s theorem 53 ISOl, the 


direct product of compact spaces with the product topology is necessarily compact. All the 
cr-algebras are required to be the Borel cr-algebra. The linear maps are restricted to those 
that are weak* continuous; in other words, those maps that are the adjoints to linear maps 
on continuous functions going in the opposite direction. In practice, rather that working with 
the adjoint maps, one works with the linear maps in the opposite direction. The propositions 
A1.3, Cl.l, and C2.4 gives the sufficiency of this prescription. 


simple function takes on only finitely many values. 

^^Using the total-variation norm. 

measure is atomic if there is a union of countably many atoms in the cr-algebra such that the 
complement of the union has measure zero. 
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Comments on the two options 

For option F, the need for base measnres is not generally a tronblesome issne. For cx-algebras 
generated by a countable number of atomic subset^^ the counting measure that assigns one 
to each atom is a base measure for any hnite measure. For classical physics, with conhgura- 
tion space A and phase-space given by the cotangent bundle T*A, the symplectic phase-space 
volume- formp^ D provides a natural base measure, since by Heisenberg’s uncertainty princi¬ 
ple, not even all measures absolutely-continuous with respect to H are accessible, let alone 
more singular measures. 

Note that allowing the base measure to be a cr-hnite measure is only for purposes of 
convenience in allowing the commonly employed Lebesgue measure on unbounded subsets 
of M"', as the following theorem shows: 


Theorem 3.2.2 Given any cr-hnite measure /x on a set A, there is a hnite measure u such 
that L^{A; fi) is isometric to L^{A; u), where the isomorphism is a pointwise scaling. 


Proof If /i is hnite, there is nothing to show, so assume it is not. Since /i is cr-hnite, 
there is a countable collection of disjoint subsets {Bj} of A such that is hnite and 

nonzero for each j G {1, 2,...} and U^i Then let the hnite measure u be given by 

I n 

However, there are two complaints with option I’. One is that for base measures with an 
inhnitely-divisibl^^ part (see B1.7), there is no Copy map (by the same argument as in the 
proof of B1.8); hence, except for this (ehectively discrete) case, hidden nodes either have 
only one child node or are terminated. The second complaint is that passing a continuously 
variable parameter to a hidden node as a simple number is not permitted (unless that 
particular value corresponds to an atom in the base measure); instead, one must use a 
sharply peaked measure. This adds signihcant complexity for little gain in cases where 


one is not especially interested in modeling uncertainty in the inputs (see ^4.5 and 17.2 for 
instances). 

For option II, the restriction to topological spaces and Borel a-algebras is also not trou¬ 
blesome, since these are typically used in any case. The limitation of using compact spaces 
appears severe, but locally compact spaces^® can also be used with the restriction that the 


subset in a cr-algebra is atomic if it is indivisible in the cr-algebra. 

^“^As a measure, locally H is simply Lebesgue measure with respect to any canonical choice of local position 
and momentum coordinates. 

^®A CT-finite measure fj, for a measure space is infinitely-divisible if for any e > 0, there is a 

countable (finite if fj, is finite) partition of A, {Bj}, with each < e. 

16 a 

space is locally compact if it can be compactified by the addition of one point, the point-at- 


infinity. 54 . 81 
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maps take continuous functions vanishing at infinity to continuous functions vanishing at 
inhnity, so the adjoint maps on measures do not “leak away” measure at infinity. 

In addition, for option II, the first complaint above does not occur since the Copy map is 
adjoint to the map Copy^ that takes continuous functions on A" x A" to continuous functions 
on A by (Copy^/)(a;) = f(x,x) (which can obviously be generalized to make any hnite 
number of copies). The second complaint does not occur either since the evaluation map is 
well-dehned for continuous functions. However, there is now the opposite problem in that we 
wish to calculate probabilities on sets, so we need maps on characteristic functions, not just 
continuous ones. One solution is to extend each map to one from bounded, Borel measurable 
functions to bounded, Borel measurable functions; by C2.9 this can always be done in a 
unique manner. Another solution is to use the results on continuous functions to get the 
result for characteristic functions of open sets as in the proof of the Riesz theorem 78 ; then 
outer regularity gives the result on any characteristic function of a Borel set. 

Note that for option I’, the considered linear map L : — )■ L^(y;iy) is al¬ 
ways induced by a conditional probability which is given by, for i/-measurable sets 

B, e(H|-) = L*1b (which is of course actually an equivalence class of functions that agree 
almost everywhere with respect to p) with the adjoint map L* : L°°(y; u) —)■ L°°(A; p): 


{Lf){y)du{y)= lB{Lf){y)du{y)= f{x){L*lB){x)dn{x) (3.7) 


lyeB 


ly&y 


'xex 


f{x)^{B\x) dfi{x) 


Ix&X 


for any / G T^(A; /i). By ASA, if either: (i) the a-algebra for 3^ is generated by countably 
many atoms; or (u) 3^ is an uncountable, complete, separable, metric space with the Borel a- 
algebra-then a transition probability function r(-|-) does exist that is a lift of .^(■l-). Similarly, 
if we use option I with the measures limited to the atomic measures, the considered linear 
map L : A{X]S) —)■ A{y; IF) is always induced by an object r(-|-) which is given by, for sets 
B E y, t{B\x) = {LSa){B) for x in the atomic set A E S: 


{Lfi){B) = / T{B\x)dfi{x) (3.8) 

Jx&X 

for any atomic measure p G A{A',S). However, in general r(-|-) will not be a transition 
probability function since there is no reason for t{B\-) to necessarily be £^-measurable. Also, 
for option II, if 3^ is metric, by C2.8, for the considered linear map L : C(3^) —)■ C(A), the 
adjoint map L* : Ad (A) — )■ M.{y) is induced by the transition probability function r(-|-) 
given by t{B\x) = {LSx){B) for any x G A and Borel subset B C y: 

{L*y){B) = f t{B\x) dy,{x) (3.9) 

Jxex 

for any Radon measure /i G Ad (A). However, despite the existence of transition probability 
functions (or similar objects) in most encountered cases, it is still more fruitful to consider 



CHAPTER 3. HIDDEN CLASSIC AND QUANTUM NODES 


24 


the linear maps themselves as the primary objects of interest rather than the transition 
probability functions (or similar objects). As is shown in the following section, the linear 
maps can be generalized to linear maps on structures other than measures, whereas the 
transition probability functions or similar objects do not generalize. 


3.3 Quantum nodes 


Expanding the space of considered maps 


L8 


As an alternative to linear maps on measures, consider linear maps on density matrices 
or, more generally, density matrix-valued vector measures. We will show below that, with 
some restrictions on the maps, this can be made to work consistently with the propositions 


given in 13.1 We will show in 15.1 that this gives rise to models that are consistent with 


the usual, textbook quantum mechanics; therefore, nodes whose linear maps involve density 
matrices will be termed quantum. However, the term quantum should not be taken to imply 
these maps are necessarily only of utility in situations traditionally thought of as in the 
domain of quantum mechanics-in principle, any hidden node in any forecasting situation, 
say forecasting weather or stock prices, could be a quantum node. It is up to experiment to 
determine if these are of utility. Thus far, we are unaware of any structures besides measures 
and density matrices that have the requisite properties to be employed in modelling. The 
question of whether or not there are such additional structures will be further explored in 
below. 

Following our work in the preceding section, for option I, instead of linear maps on subsets 
of real-valued measures, we have linear maps on subsets of the ©(H)'*'-valued vector measures. 
For option I’, we will require the vector measures to be absolutely continuous with respect 
to a base measure. Since T>(H) has the Radon-Nikodym property 18 , this is equivalent to 


having P(H)+-valued, Bochner-integrable functions. The constraint of potential universality 
mandates having well-dehned tensor product of maps; this is maintained by A1.3 and B4.1 
if the tensor product is bounded. 

Similarly to the classical case, another sufficient approach to implementing option I is to 
take the atomic, P(H)'^-valued vector measures, as is shown by B6.1 and B6.2. It is also 
possible to combine these two sufficient approaches, say by using the atomic, L^{X; /i; T>(H))- 
valued vector £^-measures on 3^. This is sufficient by B4.1, B6.1, and B6.2. 


Hilbert space will be taken to be any complete, sesquilinear inner-product space, without regard to 
cardinality of dimension or separability. The density matrices will be taken to be the self-adjoint, 

positive operators on a given Hilbert space H. 

linear map from 1?(H) to 'D(J) is commonly referred to as a superoperator in the literature. We choose 
not to employ this terminology for the following reasons: (i) linear maps is already standard mathematical 
terminology and is in common use in the analogous classical situation, for instance Markov maps; (ii) 
superoperator seems to imply a map on all bounded operators, B{V\), when in general it is not possible to 
extend the domain of the map beyond the trace-class operators, 5i(H); and (in) the use of super- risks 
confusion with the unrelated supersymmetry and superstrings. 
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For option II, instead of maps on real-valued, positive, continuous functions, one has maps 
on continuous functions that take values in the self-adjoint, positive, compacij^ operators on 
a Hilbert space H, /C(H) + . Self-adjoint, compact operators are used since they are the predual 
to the self-adjoint trace-class operators. 65 Once again, potential universality mandates 


having well-dehned tensor products of maps; as before, this is maintained if the tensor 
product is bounded, now by A1.3 and C3.1. 


Problem arising from positivity and potential universality 

Of course, everything is not really that simple. The tensor product of bounded maps may 
not be bounded. Also, even if both maps are positive, their tensor product need not be. 
To illustrate these problems, take a separable, inhnite-dimensional Hilbert space H, £x some 
orthonormal basis {ej}, and consider the transpose map T relative to that basis: 



(3.10) 


where is the functional (-, 6 ^). This is well-dehned on the space of density-matrices on 
H, P(H)+, since, by the spectral theorem for compact operators (^, any such operator 
can be written in the form of an infinite matrix with hnite rank operators corresponding 
to truncated matrices converging in trace norm. (Of course, since density-matrices are self- 
adjoint, T could also be termed the conjugate map relative to the basis). This map is clearly 
positive and has operator norm one. 

However, consider the tensor product map T®Ij^^ acting on density matrices , 

where Im^ is the identity map acting on n x n-matrices. For the rank-one operator 'ip 
with 


n—1 




'(n+l)j+l 


(3.11) 


1=0 


we have (T (g) /_a 4„)('0 ® 'ip*) = S. Truncating S to the span of {ei,... , 6 ^ 2 } (it is zero 
elsewhere), it is the matrix form of the transpose map acting on Ain written in vector form 
using the Vec operatiorQ Therefore, S clearly has eigenvalue one with multiplicity U'n-+L 
and eigenvalue minus one with multiplicity _ Hence, T Imu is not positive, and 


\\T®I. 


Mn l|op ^ 


n{n + 1 ) 1^1 n.(n — 1 ) 


— 1 | 1 = — = n 
n 


(3.12) 


By \\T®Ij^^ Hop = n and this example is maximal. This is clearly unbounded as n —)■ cx). 

operator is compact if the image of any bounded sequence has a convergent subsequence. 

^°Vec takes a n x n-matrix to a column vector of height by stacking columns. 
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Solution to the problem 

The solution to both the positivity and the boundedness problem is to require complete- 
positivity for the maps. This has several dehnitions (see B2.6, B5.6, C2.1) that are equiv¬ 
alent (see B5.8, C5.6); the basic notion is that all tensor products with various identity 
maps should be positive. Since the composition of positive maps is positive, this immediately 
implies that complete-positivity is preserved under both composition and tensor products. 
Furthermore, from B2.5, B2.5, C2.4, and C5.3, the operator norm is a cross-normp^ for 
completely-positive maps, so this resolves the boundedness problem as well. The completely- 
positive maps clearly form a convex cone within the space of all maps; this cone is closed in 
the norm topology and in various weaker topologies by B5.12, B5.14, and C5.10; however, 
unlike the cone of positive maps, in infinite dimensions it has no interior in any of these 
topologies, which raises issues for approximation in numerical calculation. 


3.4 No quantum copying 

It is commonly stated that cloning is something that is possible classically, but is impossible 
in quantum mechanics. This is based on false analogy. The correct situation is that there 
are two different notions, that of copying and that of cloning, that are being confused. Once 
these two have been separated, we have the following situation: 

classical quantum 


copying 


cloning 


Exists and implementable 
since linear. 

Exists but not imple¬ 
mentable since neither 
linear nor the ratio of linear 
maps. 


Does not exist. 

Exists but not imple¬ 
mentable since neither 
linear nor the ratio of linear 
maps. 


The confusion is comparing the upper left and lower right entries instead of correctly going 
across. Cloning is possible for neither classical nor quantum Bayesian networks (as will be 


shown below in 16.1) for exactly the same reason, so it does not differentiate the two. On the 


other hand, copying is possible classically (except for issues arising from potential universality 


considered above in 13.2), but cannot even be defined as a mathematical operation on density 
matrices. 


norm is a cross-norm if ||a 0 b|| < ||a||||b||. Note the property of being a cross-norm depends on the 
choice of norms for the individual spaces as well as for the larger space containing the tensor products. 
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Classical copying 


As has already been mentioned (see |:2.5), there is a Copy map from measnres on a set X 
to measnres on X x X. When A is a compact set, this map is weak*-continuons, being the 
adjoint of the previously discussed map (Copy^/)(a;) = f{x,x). The Clone map is given 
by /i —)■ /r X /i. For the single atom measure for atom C, where for any measurable subset 
Ac X, 

^ ' I ii Ac C , , 

^c(^) = ^ , . (3.13) 


we have 


0 otherwise 


Copy Sc = Sc X Sc = Clone Sc 


(3.14) 


This is likely the source of confusion between the Copy and Clone maps for the classical 
case. 

Instead of using this explicit form for Copy, an approach that will prove useful in the 
quantum case is to start with some basic properties, then hnd the implications. One property 
of what is commonly accepted as the notion of a copy is that the probability for both copies 
to have a specihed property is equal to that for each copy to have it, which is equal to that 
of the original having it, so for any unit-norm measure fi on X and any /x-measurable set 
Ac X, 


Property C Copy fj.(A x A) = Copy /i(A x X) = Copy ft(X x A) = /i(A) 

Note this property implies Copy /x(A x (A \ A))) = Copy /i((A’ \ A) x A) = 0. Now given 
a (j-algebra S of subsets of X, let IF be the cx-algebra generated by the rectangular subsets 
£ X £. Then we have the following: 


Theorem 3.4.1 Any map L from unit-norm £^-measures on X to unit-norm J^-measures 
on X X X obeying property C is linear for convex linear combinations. 


Comment Since any hnite measure can be scaled to have unit-norm, this implies the 
map can be extended to all hnite measures, with the extended map being positively linear. 
By the generating property of measures among signed measures as a result of Jordan de¬ 
composition 1^, this implies the map can further be extended to a linear map on signed 
measures. 


Proof Let L be such a map and /i an unit-norm ^-measure on X. For any subsets A,B C £, 
by the properties of measures, 

(L/x)(A X B) ={Lij,){{A n 5) X (A n B)) + n 5) x (5 \ (A n B))) (3.15) 

+ (L^)((A\B)y.B) 
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However, (AnB) x (B\(AnB)) C (AnB) x (A’\(HnH)) and (A\B)xB C (A\B)xB, 
so, by property C and its implication, 

(Lfi)(A xB) = {Lfi){{A nB)x{An B)) = /i(H O B) = {Lfi){B x A) (3.16) 

Let p be another unit-norm £^-measure on X. Then for any t G [0,1], (1 — t)p + tp, will 
be a unit-norm T-measure on X. Consider the signed measure on X x X given by 

ut = L{{1 — t)p + tp) — (1 — t)Lp — tLp (3-17) 

Take any A & S. Then, by property C, x H) = 0. However, by the above symmetry 
property of L, 

ut{A X B) = ^ {ut{A X B) + vtiB x H)) (3.18) 

which is equal to 

\ (z/i((H U H) X (H U B)) - utiiA \B)x{A\ B)) (3.19) 

\A)x{B\ H)) + z/,((H nB)x{An B))) 

which is zero by the preceding property of z/^. Since A, B were arbitrary, Vt must be the zero 
measure. □ 

Quantum Copying 

The Clone map taking a density matrix on the Hilbert space H to one on H ® H is defined 
as p —)■ p 0 p. How to dehne a Copy map is not obvious. By analogy to the classical case, it 
should have the following properties for any unit-trace density matrix p and any projector 
E: 

Property Q ti {E ® E) Copy p = tr {E ® Iy\) Copy p = tr (Jh ® E) Copy p = tr Ep 

Note this implies tr [E ® (Jh — E)) Copy p = tr ((/h — E) ® E) Copy p = 0. Then we have 
the following: 

Theorem 3.4.2 Any map L from unit-trace density matrices on H to unit-trace density 
matrices on H ® H obeying property Q is linear for convex linear combinations. 

Comment Since any density matrix can be scaled to have trace one, this implies the map 
can be extended to all density matrices with the extended map being positively linear. By 
the generating property of density matrices among signed density matrices as a result of the 
spectral theorem for compact operators, this implies the map can further be extended to a 
linear map. 
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Proof Let L be such a map and p an unit-trace density matrix on H. For any commuting 
projectors E, F, 

tr {E (g) F){Lp) =tr (EF 0 EF){Lp) (3.20) 

+ tr {EF O (F - EF)){Lp) + tr {{E - EF) ® F){Lp) 

However, by positivity and the implication of Q, 

0 < tr {EF O (F - EF)){Lp) < tr {EF ® (Jh - EF)){Lp) = 0 (3.21) 

and 

0 < tr ((F — EF) (g) F){Lp) < tr ((/h — F) (g) F){Lp) = 0 (3.22) 

so, using Q, 

tr (F ® F){Lp) = tr {EF (g) EF){Lp) = tr EFp = tr (F 0 E){Lp) (3.23) 

Let r be another unit-trace density matrix on H. Then for any t G [0,1], {l — t)p + tT will 

be a unit-trace density matrix on H. Consider the signed density matrix on H 0 H given by 

vt = L((l — t)p + tr) — (1 — t)Lp — tLr (3.24) 

Take any projector E. Then, by property Q, tr (F 0 E)i>t = 0. However, by the above 

symmetry property of L, for any commuting projectors F, F, 

tr (F 0 F)i>t = ^ (tr (F 0 F)i>t -|- tr (F 0 E)i>t) (3.25) 


which is equal to 

^ (tr ((F + F - EF) 0 (F -f F - EF))vt - tr ((F - EF) 0 (F - EF))vt (3.26) 
—tr ((F — EF) 0 (F — EF))vt + tr {EF 0 EF)vt) 

which is zero by the preceding property of Vf Since F, F were arbitrary, z/* must be the zero 
operator. □ 

We then have the following theorem, based on the argument of Wooters and Zurek [126| that 
density matrices of rank greater than one can be expressed in more than one way (inhnitely 
many ways actually) to create a contradiction. 


Theorem 3.4.3 


There is no quantum Copy map for non-trivia! 


22 


H. 


trivial Hilbert space has dimension one. 
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Proof Take any orthonormal {u, v} C H with corresponding adjoint operators u* = (•, u)h 
and V* = (•, v)h. Consider p = |(u®u*+v® v*). By linearity, 

Copy P = ^ ( Copy (u (g) u*) + Copy (v (g) v*)) (3.27) 

By property Q, for rank one density matrices Copy must be the same as Clone, so Copy p 
is uniquely given as 

-(u(g)u(g)u*(g)u* + v(g)v®v*(g) V*) (3.28) 

However, it is also possible to write p as 

^ ((u + v) (g) (u + v)* + (u - v) (g) (u - v)*) (3.29) 

Then Copy p is uniquely given as 

^ ((u + v) g) (u + v) g) (u + v)* g (u + v)* (3.30) 

O 

+ (u - v) ® (u - v) g (u - v)* g (u - v)*) 


= -(ugugu*gu* + ugugv*gv* + ugvgu*gv* 

4 

+ ugvgv*gu*+vgugu*gv*+vgugv*g u*+ 

+V g V g U* g U* + V g V g V* g V*) 


Clearly, (3.28) and (3.31) are unequal, which is a contradiction. □ 


(3.31) 


3.5 Embedding quantum models into classical ones 

A construction for option I’ using atomic measures 

For option I’, there is a way to embed quantum behavior into a purely classical, but con¬ 
textual, model. For any Hilbert space H, let §h be the closed unit ball within H. Let §h/ ~ 
be the quotient set formed from Sh by the equivalence relation -0 ~ if there is a phas^^ 
w such that 'ip = wQ Clearly Sh/ ~ is in one-to-one correspondence to rank-one projectors 
on H. Let S be any cx-algebra on Sh/ ~ such that all points are atoms (such as the Borel 
(j-algebra). For any set X and base measure p denote the space of atomic, hnite-norm, 
p)-valued vector £^-measures on Sh/ ~ by ^(Sh/ S; p)). This is a Banach 

space by B6.1. Given r G ^(Sh/ T;/i)), for any /i-measurable subset B C X, 
dehne tb to be the atomic, signed T-measure on Sh/ ~ dehned by tb{A) = f^T(A)dp for 


^^Elements of C with magnitude one. 
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any A G T. Let be the equivalence relation on A (Sh/ ju)) given by r y if, 

for any /x-measurable subset B G X, 

/ ss* drsis) = / ss* dxB{s) (3.32) 

Since the equivalence class using of zero is a closed, linear subspace, the quotient space 
A (Sh/ fi)) / ~' is a Banach space using the standard norm for quotient spaces, 

||[r]|| = inf^g[T-] ||x||. Dehne the positive cone on the quotient space to be those equivalence 
classes with a positive member, using the obvious notion of positivity on A (Sh/ rNj ■ 

Then we have the following theorem: 

Theorem 3.5.1 There is a positive, linear, isometric isomorphism, 

L‘ (X-, K ®(H)) £; L\X-, ^)) / -' 

Proof If the measure fi is trivial, then [X] = T>(H). Define the map T ; 

(T;/i; 11(H)) —)-^(Sh/ ^;S; L^{X; jj,)) / ~'byT(p) being the equivalence class of 
for p = Ylij with countable collections {'ipj} G Sh and {aj} C M, which is always pos¬ 

sible by the spectral theorem for compact operators. 

For more general measures on X, hrst start with the observation that, given any p G 
{X; /i; 11(H)), each p{x) lives in the same separable subspace of H for almost every x with 
respect to p, namely the subspace G that the operator |p(a^)| dp G 'Zl(H) lives in. Let {oj} 
be an orthonormal basis for G. For each j G {1,2,...}, let Pj be the orthogonal projector 
onto the span of (ei, e 2 , ..., e^}. Since simple functions (more properly, the “functions” 
which are equivalence classes containing simple functions) are norm-dense in {X; p; ©(H)), 
the sequence (PjpPj) is a Cauchy sequence by C4.3; hence, it converges in norm by the 
completeness of the Banach space [X] p]D{H)). It is readily seen that the limit point is 
p. Now dehne the map T : [X] p]D{H)) —> ^(Sh/ L^{X] p)) / by hrst dehning 

'F('^'^*/), for G H and / G L^{X-,p) to be the equivalence class of the vector measure 
/ ® S[p]. Since the linear space ©(P^H) is spanned by the operators 

{®j6jC {(gj -|- e/j)(ej -|- gQ , (e^ -|- iGQi^Gj igQ (3.33) 

the map T can be extended by linearity to all p with the property that p{x) lives on the 
same hnite-dimensional subspace of H for almost every x with respect to p. 

Now take any p with this property which is also a simple function. Then, applying the 
(hnite-dimensional) spectral theorem to each of the hnitely many values p takes, T is readily 
seen to be a positive isometry. Since simple functions are norm-dense, by A1.2, T is a 
positive isometry on all p with the property that p{x) lives on the same hnite-dimensional 
subspace of H for almost every x with respect to p. However, by an above argument, such p 
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are norm-dense in (A; /i; D(H)). Hence, using Al. 3 to extend \1/ to all of (A; /x; ©(H)), 
d' is a positive isometry. It remains to show it is surjective, but that is easily seen, with 


^-1 








(3.34) 


for any Y.jfj ^ ^ h^(A;/i)). □ 


Comment Since Sh xSj ^ Sh^j if neither H nor J are trivial, this construction is necessarily 
contextual. Therefore, it does not violate Bells’ inequality (see ^6.3| below). 


The nonexistence of the corresponding construction using a base 
measure 

The corresponding construction using a base measure u would be for there to be a positive 
isometry from L^(A;/x; T>(H)) to a quotient space of some L^{y]u). This is impossible, as 
the following theorem shows: 


Theorem 3.5.2 If the Hilbert space H is non-trivial, there are no: (i) set 3^; (ii) a- 
hnite measure z/; and (iii) equivalence relation ~ induced by a closed, linear subspace 
B C L^(3^; z/)-such that there is a positive, linear isomorphism, 4/ : (A;/x; D(H)) — 

which is also an isometry on the positive cone. 


Proof Suppose otherwise. Then there is a (A; /x;'D(H))-valued vector measure r on 3^ 
provided by r(A) = ([1 a]) for any z/-measurable subset A C 3^. The spaces T^(3^; u)* = 

by Riesz’s theorem [^. The dual to L^(3^;z/)/ ~ is provided by the annihilator 
B-*“: the closed, linear subspace of L°°{y-^v) that annihilates B. (Note, in particular that 
since T is an isometry on the positive cone, the constant “function” [ly] G B-*“.) Therefore, 
there is an adjoint map T* : B-*“ —(A; /x;'D(H))* given by 


f'Updu = (T*/)p 


(3.35) 


'y 


for any / G B-*- and p G (A; /x; V{H)). By the basic properties of Banach spaces, (T*)“^ = 
(vl/-i)*, vl>* is positive, and T* is an isometry on the positive cone. 

Then, 


= <h(r(A)) 


(3.36) 


for any linear functional <h G (A;/x; P(H))* and //-measurable subset A C 3^. For Jh 
the identity operator, T*“^(/h1a’) is the element of B-*“ that agrees with the norm when 
integrated with any positive “function” in L^(3^;z/)/ hence, it must be ly. Therefore, 


tr r(A) d/x = / T* {ly^lx) dv = v{A) 


' X 


(3.37) 
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However, this gives rise to a contradiction. Fix some subset B C A with 0 < /u(B) < oo. 
Take any unit norm G H. Since \1/ is positive, by dehnition, the equivalence class 
has a positive member, call it g^. There must be some V' 7^ ■C such that A = {g^ > 
0 } {g^ > 0} has strictly positive //-measure; otherwise, there would be an uncountable 

collection {g^ > 0} indexed by unit norm -0 G H of subsets of 3^, each with strictly positive // 
measure, but whose pairwise intersections all have //-measure zero. The existence of such a 
collection would contradict // being cx-hnite by B1.6. Since T is an isometry on the positive 
cone. 


[ g^du = ij,{B) ti I iji/j* = H{B) = jj,{B) ti= [ g^{y) (3.38) 

Jy Jy&y 

Hence, (r-^, '^) < tr r = // must be equal to // when restricted to {g^ > 0} D H. By a 
similar argument, must must be equal to // when restricted to {g^ > 0} D H. These 

conditions are impossible to satisfy. □ 

The special case of two-dimensional Hilbert spaces 

It is possible to circumvent the conclusion of the preceding theorem if the positive iso¬ 
morphism is with a closed, linear subspace of L^(3^;//)/ ~. Specialize to y = A x 2 
and f = g. X g and let C be the closed, linear subspace of L^{A x Z]g x g)/ ~. Using 
the notation of the preceding proof, let be the equivalence relation on the annihilator 
B-L X Z] g X g) induced by the annihilator C-*- C L°°{A x Z; g x g), so f g it 

J^^^[h]f d{g X g) = J^^^[h]g d{g x g) for all [h] G L^{A x Z;gx g)/ ~. From the proof 
of the preceding theorem, it is then necessary that A sufficient way 

to accomplish this would be for there to be a positive map (not necessarily linear in 'ip'ip*) 
A : Sh/ ~phase—t L°°{Z;g), where ~phase is the equivalence relation on Sh given above, such 
that 

= [lx Z Hi>)] for all -0 e Sh/ ~phase (3.39) 

and, for almost every z & Z with respect to g, the map V’ A{'ip){z) is a frame function 
with weight on^^ 

For H of dimension two, we then have the following based on a construction by Kochen 
and Specker [^. Let u be the usual measure on the sphere S^ C so, with Cartesian 
coordinates (xi, 0 : 2 , 0 : 3 ) for 

u:{A) = / = {xi dx2 A dx^ — X2dxi A dxs + x^dxi A dx2) (3.40) 

J {xi,X2,X3)eA 

for any Borel subset A C S^. Let ~ be the equivalence relation on L^{A x S^; g x uj) given 
by / ~ ^ if X uj) = J^^jjgd{g x 00 ) for all /i-measurable subsets B C A and all 

function / : Sh/ ~phase~^ K is a frame function with weight w if it is zero except for a separable 
subspace of H and /(®i) = ^ orthonormal basis {e^} of that subspace. 



CHAPTER 3. HIDDEN CLASSIC AND QUANTUM NODES 


34 


hemispheres H C (whether the hemispheres are taken open or closed is irrelevant). Equip 
the quotient space L^(A xS'^; fixu)/ ~ with a norm in the usual way via || [/]|| = infgg[/] || 5 f||. 
Dehne the positive cone on L^(A x §^;/i x a;)/ ~ by those equivalence classes that contain 
a positive element of L^(A x ju x cj). 


Theorem 3.5.3 There is a positive, linear isometry, 4/ : (A; —>■ L^(A x §^;/i x 

u)/ with an associated map A : §h/ ~phase—t with the above properties. 


Proof Take any orthonormal basis {ei, 02 } for H = C^. Let T : (T; /i; T>(H)) —^L^(Ax 

X u)/ ~ be given by hrst taking T on elements of the form '0V’*/ '0 ^ Sh and 

/ E L^(A; /u) to be the equivalence class of the positive “function” 


/■ 



if ys > 0 
otherwise 


for y = [ 1-010 - 1020 3^20102 S 20102 ] and s = 


51 

52 

53 


e 


(3.41) 


where 0j = (0,e0. Since H is two-dimensional, T>(H) is four-dimensional, with basis 
{ 61600262 , (oi -|- 62 )( 6 i -|- 62 )*, (oi -f- * 62 )( 6 i -|- ^ 62 )*}. Therefore, T can be extended by 
linearity to all of (T;/r; ©(H)). 

For p a simple function, by using the spectral theorem for the each of the hnite num¬ 
ber of values p attains, the map T is readily seen to be positive and an isometry. Since 
simple functions (more properly, the “functions” containing simple functions) are dense in 

(T; p; T>(H)), by A1.2, T is a positive isometry for all of (T; /i; T>(H)). 

For this T, an associated A does exist. It is given by hrst dehning 


“ I/ 2 P 3f^2^i6 A2^i6 ] ^ / oc y^^^ei +-^====62 (3.42) 

where the proportionality for ^ is up to an irrelevant phase. Then A(,^) = , where H^ is 

the hemisphere centered at This has the required properties since (i) for any orthonormal 
(00, A(0+A(C) = = ls 2 , with equality in the L°°(S^;a;)-sense of almost 

everywhere with respect to uj and (ii) for any 00 G §h and / G L^{A; p), 

(Ia' ® A(0)T(00*/)d(/x xu}) = ^(1 + yz) = 1(^)0)!^ ° 


However, for Hilbert space H of dimension greater than two, there is no construction of this 
form because Gleason proved that all frame functions on Hilbert spaces of dimension 
greater than two are regulaip^ 


frame function is regular if it is given by 0 —>■ {Tip, ip) for some trace-class, self-adjoint operator T. 
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Theorem 3.5.4 There are no: (i) set Z; (ii) cr-finite measure r]; and (iii) equivalence 
relation ~ induced by a closed, linear subspace B C L^{A x Z;fi x ri)-such that there is a 
positive, linear isometry, T : (T; p;P(H)) —>■ L^{X xZ\ fixrj)/ which has an associated 

map A : Sh/ ~phase— t L°°{Z] rj) with the properties given above. 


Proof Suppose otherwise. By Gleason’s result, there is a T G 77 ; T>(H)+) such that 

T{z) has trace one for almost every z & Z with respect to r] and 

[ ( 1 ;, ® (Te, 0) d{fx Xr]) = \ (e, rA(5) (3.43) 

JxxZ 

for all G Sh/ ~phase and p-measurable B <Z X. However, then following the argument of 
theorem 3.5.2, there is a contradiction. Fix some subset B <Z X with 0 < ^i{B) < 00. Take 
any unit norm -0 G H. Since T is positive, by dehnition, the equivalence class T('0'0*[l5]) has 
a positive member, call it g^. There must be some 7 / 7^ ■C such that A = {g^ > 0} H {g^ > 0} 
has strictly positive i/-measure (Since g'ii}-ig^ are “functions”, A is actually an equivalence 
class of sets that differ by sets of //-measure zero). Since T is an isometry on the positive 
cone. 


'XxZ 


g^ X g) = fi{B) tr = T{d3) = fi{B) tr 


(3.44) 


(lx Z (T'lp.'il))) g^d(g, x g) 


' XxZ 


Hence, QT'ip^'ip) < tr T = 1 must be equal to one almost everywhere with respect to g when 
restricted to {g^ > 0} D A (once again, this is actually an equivalence class of sets that 
differ by sets of //-measure zero). By a similar argument, (T^,^) < tr T = 1 must be equal 
to one almost everywhere with respect to g when restricted to {g^ > 0} D A (also actually 
an equivalence class of sets that differ by sets of //-measure zero). These conditions are 
impossible to satisfy. □ 


A construction for option II 


For option II, there is also a way to embed quantum behavior into a purely classical, 
but contextual, model. For any Hilbert space H, let Bh be the closed unit ball within 
H. Equip Bh with the weak topology; denote the resulting space by Since Hilbert 

spaces are reflexive, with H* = H by Riesz’s theorem 84 , then this is the same as the 


weak* topology, so by Alaoglu’s theorem 85 , is a compact space. Furthermore, it is 

Hausdorff since, given any distinct points bo,bi, there are separating weak neighborhoods 
Af {bo;bi - bo] l\\bo - biW^) and Af {bi]bi - bo] l\\bo - 6i||^). 

Then, for any compact, Hausdorff space X, we have the following theorem: 


Theorem 3.5.5 There is a positive, isometric isomorphism between C(A’;/C(H)) and a 
closed, linear subspace of C (A x . 




CHAPTER 3. HIDDEN CLASSIC AND QUANTUM NODES 


36 


Proof Consider the map \1/ :C(A’;/C(H)) C (A x given by 

= ((p(x)b,b} (3.45) 

The map 4/ is clearly both positive and an isometry. From the latter property, its image is 
a closed, linear snbspace. The fnnctions in the image of the map are also clearly continnous 
in x; the following lemma shows that they are continuous in b, so, as claimed above, they 
are indeed continuous functions. □ 


Lemma 3.5.6 For any compact operator 0 G /C(H), the map <F : —)■ M given by 

<h(6) = (05,6) is continuous. 


Proof First take the case of 0 positive and rank one, so 0 = 'ifj'ijj* for some '0 £ H. Then, 
given any bo G Bh and e > 0, there is a weak neighborhood M (bo] i)] vT(?7WTe - I {i), bo) I 
such that for every b in the neighborhood, using the triangle inequality repeatedly. 


|$(6) - <F(6o)| = ||(^,&)P - |(V’,^)n < (l(V',^)l + l(^,^o)|) - (^,^o)| (3.46) 

< + l(^,^o)|) (Vl(^,&o)P + £ - l(V',^o)|) 

< (2 |(V',&o)| + V\{'^^bo)\‘^ + e - |(V',&o)|) (vWTMPTe- l(^,^o)|) 


which is equal to e. 

Since hnite rank operators are norm-dense in /C(H), for any 0 G /C(H) and e > 0 there 
is are hnite collections {'ipj} C §h (for §h the unit sphere in H) and {aj} C M such that 

0 - ELi 


op 


< \e. Then, by the triangle equality and the above result, for every b 


in the neighborhood 







\{'iljj,bo)\^ +—e 



(3.47) 


we have |<h(6) — *h(feo)| < £. □ 


Comment Since Bh x Bj ^ Bh^j if neither H nor J are trivial, this construction is neces¬ 
sarily contextual. Therefore, it does not violate Bells’ inequality (see ^6.3| below). Also, for 
H hnite-dimensional, it is possible to use §h instead of Bh and the norm topology (which 
is equivalent to the weak topology in hnite dimensions). In this case, it is also possible to 
further reduce the space by the equivalence relation on §h used in the preceding construction 
for option I. 
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3.6 Embedding classical models into quantum ones 

For cr-algebras generated by a countable collection of atoms, using option I’ it is always 
possible to duplicate classical behavior using quantum nodes; one simply embeds L^(A;/u) 
as the diagonal operators in D(L'^(A; Similarly, using option II, if df is a hnite set 

of points with the discrete topology, C(A) can be embedded as the diagonal operators in 
/C(L^(d:’;counting measure))’*'. 

By 5.28, this is the only way to linearly embed in 'D(L^(A’;/i))’*’ if we want 

to take advantage of the one-to-one correspondence between functions in — 

and the orthogonal projectors Pb € fi)) to the subspace of “functions” 

zero almost everywhere outside B provided by the the map diag* dehned in the comment 
following B5.19. However, if the measure has an inhnitely-divisible part, then unless / G 
is zero on this part, by B4.3 there is no corresponding diagonal operator in 
P(L^(d:’;/i))’*’. One alternative is the map diag”^ into equivalence classes of trace-class 
operators with common diagonals dehned in B5.17 and used in §5.1| below. 


3.7 Classical physics 


Classical mechanics 

Another approach, which is that of classical mechanics, is to not completely duplicate quan¬ 
tum behavior, but instead approximate it. The key is to recognize that L^{T*X]Q)~^ inter¬ 
sects the Hilbert space L‘^{T*X]fl). Similarly, P(L^(A’;/i))’*’ intersects the Hilbert spac^^ 
S2{L‘^{X ] fi)), which is isomorphic to L‘^{X x X]fj,x fi). Therefore, if we have an isomor¬ 
phism T between L‘^(T*X-, O) and L‘^{X x A; /i x /i), we can associate some of the elements 
of L^{T*X] H)’*’ with those in T>{L‘^{X] iT})^. 

For A = and fi mutually absolutely continuous with respect to Lebesgue measure A, 
this is indeed possible, with f{q,p) = given by 


(vrh)’' 


2ip*u\ . ^ dp, ^ dp. X X 

exp 1-y— ) p[q + u,q-u)\ —[q + u)\ —[q + u)d\[u) 




h 


dX 


dX 


(3.48) 


and inverse p{x,x') = (T ^f){x,x') = (7rh)”(\l/*/)(a;,x') given by 



X] 



X 


exp 


ip* [X — X 

h 


dX{p) 


(3.49) 


Intuitively, / G L^{T* X^D)^ will be paired with some p G D{L‘^{X-, p))~^ if p is very close 
to being diagonal and varies slowly along the diagonal whereas / obeys a local version of 
Heisenberg’s uncertainty relation, with sharp features in q being spread out in p and vice 


^^$2 denotes the Hilbert-Schmidt operators. 
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versa. Also, these pairs exist for the ground state of the harmonic oscillators, despite p 
not being very close to being diagonal. Note that when these pairs exist, as a map from 
D{L‘^{A; to L^(T*A’; f2) + , is actually an isometry. 

Two observations arise from this. Firstly, if we believe that a quantum description 
gives better predictions and that a formulation in terms of classical physics is merely an 
approximation, then the classical predictions are completely untrustworthy for non-integrable 
classical systems where / rapidly develops hne tendrils under time evolution. Evolving the 
quantum analogue of the system and then using T would give very different behavior. Either 
the hne tendrils would not form or the correlation would break down, with / = Tp no longer 
being positive. Secondly, classical physics is simpler than quantum mechanics only because 
the time evolution for a Hamiltonian system is so simple, being merely a point transformation 
on a hnite dimensional space, T*A —)■ T*A. Working with full stochastic generality using 
non-Hamiltonian systems in classical physics, the analogous quantum description is no more 
complicated. 

The preceding is evidently dependent on the existence of Lebesgue measure. Since there 
is no inhnite-dimensional analogue of Lebesgue measure [^, it cannot be extended directly 
to held theories. Instead, some measure which itself depends on H must be utilized. 

Distinguishable versus indistinguishable particles 

It is usually said that there are no indistinguishable particles classically because it is possible 
to follow trajectories, which are the time-evolution point transformations T*A —)• T*A. The 
above analysis gives a diherent interpretation that is valid in the more general case where 
there are no trajectories: if indistinguishability is important in a model, so the density matrix 
p is taken symmetric (or anti-symmetric) under interchange, 

p{xi,X2,x[,X2) = ±p{x2,Xi,x[,X2) = ±p{xi, X 2 , X^y ^l) = P{x2, Xi, X^y ^l) (3-50) 

then it is necessarily far from being diagonal, so there is likely no partner in L^{T*A]Q)~^ 
to p and a classical model is invalid. 

A similar situation holds for the case of a-algebras generated by a countable collection 
of atoms. If there is classical indistinguishability (such as dollars in a bank account), the 
measures assigned to certain atoms by any allowed measure must be the same. No issues 
are raised by embedding the measure as the diagonal in p))^. However, trying to 

recast this as quantum indistinguishability then creates off-diagonal entries without classical 
interpretation. 

3.8 Additional structures for Bayesian networks? 

It may be possible that additional mathematical structures besides measures and density 
matrices could be used as the base and target spaces of linear maps to calculate probabilities 
using Bayesian networks. To avoid topological complications, only the hnite-dimensional case 
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will be sought. Looking through the properties of measures and density matrices that are 
essential, we have the following properties required for an additional mathematical structure, 
given by a collection A of hnite-dimensional, normed vector spaces: fijfor any vector spaces 
A, B G the vector space A 0 B is also in A] (ii) each B G ^ has a positive cone, B+ that is 
generatin^^ (in) each B G ^ has the quasi-y4L-property that ||x|| -|- ||y|| = ||x-|-y|| for any 
^,y & B+; and (iv) for any x G A"*", y G B+, x ® y is in (A ® B)”^ with ||x 0 y|| = ||x|| ||y||. 
In addition, (v) it is required that there be some nontrivial (other than the identity) maps 
between these vector spaces that are simultaneously both completely positive and completely 
bounded. 

Evidently, for measures ^ is a collection of n-dimensional vector spaces for n a natural 
number, {1,2,3,...}. For density matrices it is a collection of n-dimensional vector spaces 
for n a square number, (1,4,9,...}. The question, which is open, is whether there are a 
collection of vector spaces A satisfying the preceding properties other than spaces isomorphic 
to measures, density matrices, or various tensor products of measures and density matrices. 
Obviously, this question is of vital interest-if the answer is negative, then we are limited to 
linear maps on measures, density matrices, and their tensor products in formulating Bayesian 
networks; if the answer is affirmative, then there is the immediate additional question of 
whether or not the additional structure is of practical utility in making predictions. 


^^Any X in each B G A can be written as y — z for some y, z S B. 
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Chapter 4 

Constructing the networks 


4.1 Graphical constructs and rules 

Recall the graphical model simply gives the form for the needed information and illustrates 
constraints. The actual needed data is given in an accompanying table. In the following, 
hrst the rules for the graphical model are given, then what accompanying data needs to be 
supplied depending on which option, I’ or II, is chosen. 

Nodes and terminators 

Nodes that are observable, with the traditional emphasis on the transition probability func¬ 
tion, will be indicated in black. Hidden classical nodes, where the linear map on measures is 
taken to be fundamental, will be red. Quantum nodes, whose linear maps involve non-trivial 
spaces of density matrices, will be blue. Arrows share the color of their parent. Observable 
nodes that are being incorporated into the calculation of probability will be crossed. Those 
that are being marginalized will be left open. Those that are being conditioned on will be 
hlled. Hidden (classical or quantum) nodes will always be left open. To make it visually 
clear whether or not a graph is complete and represents the calculation of a number (the 
probability) or is just a graph fragment with unresolved maps, terminal nodes which are 
either maps from the trivial space (giving initialization) or to the trivial space (giving the 
evaluation map) will be indicated by half-circle-half-box shapes, with the half-box being the 
terminating end. 


*□ 


D 


(4.1) 
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Splitters 


As has been pointed ont (see ^2.4), the usnal Bayesian network constrnction depends heavily 


on the ability to copy, with each child node receiving a copy. Withont this ability (see 13.2 


and 13.4), an additional graphical structnre is reqnired, the splitter. This is an oval with 
one or more arrows coming in and more than one arrow coming ont-if only one arrow comes 
ont, the splitter can be dispensed with and all the incoming arrows can be redirected to the 
following node. 



(4.2) 


Pince-nez 

There is a certain constrnction that occurs so often for joining hidden nodes to observable 
ones that it is replaced by its own structure, the pince-nez. As the name suggests, this is 
two circles, one an observable node, the other a hidden one, joined by a bar. 



The neutral term pince-nez is employed since the concept of “measurement’’^ applied to 
hidden nodes is so fraught with metaphysical connotations. 


4.2 Data for the structures using Option I’ 

Using option I’, for each hidden node, the most general data is 

(X,/i, J;L) 

where X is the input set, /i the accompanying base measure, H the input Hilbert space, 
O the output set, z/ the accompanying base measure, J the output Hilbert space, and L a 
completely-positive, norm-preserving (on the positive cone) map in i3(X^(X; /r; X>(H)), X^((9; 
u] X’(J))). If either Hilbert spaces are the trivial space C, it can be left off the list. Similarly, 
if either base measure is trivial, for instance, if /i(0) = 0, /x(X) = 1 with no other sets 
measurable, then it and its set can be left out as well. 


^We will always place “measurement” in quotation marks to emphasize its problematic nature. 













CHAPTER 4. CONSTRUCTING THE NETWORKS 


42 


For each splitter, the most general data is 

(X, /i, H, Oi, Ji, . . . , OfYi, Vmi -Im.) 

where X = C>i x ■ ■ ■ x Om-, /i = z^i x • • • x z/^, and H = Ji ® ® with dz giving 

the isomorphisms, which should be trivial except for possible permutations to make sure 
everything goes to the correct place, with more complicated behavior placed in separate 
nodes. Once again, trivial entries can be left out. 

For each pince-nez, the most general data is 

(X,/i,H;7^,r;C>,^/,J;X) 

where X is the input set, fi the accompanying base measure, H the input Hilbert space, IZ 
the observable set, r the accompanying base measure, O the output set, z/ the accompanying 
base measure, J the output Hilbert space, and L a completely-positive, norm-preserving (on 
the positive cone) map in H (X^(X;/i; X(H)), X^(7?. x 0 ]t x z/;X(J))). Again, trivial entries 
can be left out (however, if r is trivial, then the pince-nez is actually just a node). 

4.3 Data for the structures using Option II 

Using option II, for each hidden node, the most general data is 

(X,H;0,J;X) 

where X is the input compact space, H the input Hilbert space, O the output compact 
space, J the output Hilbert space, and L a completely-positive, norm-preserving (on the 
positive cone) map in H (C(C>;/C(J)), C(X;/C(H))). As previously noted, the spaces can be 
locally-compact if L is further restricted. Trivial entries can be left off the list. 

For each splitter, the most general data is 

(X,H;C>i,Ji,...,C>^,J^;T) 

where X = C>i x • • • x Om with the product topologies agreeing as well, and H = Ji 0 • • • 0 J^, 
with d/ giving the isomorphisms, which should be trivial except for possible permutations to 
make sure everything goes to the correct place, with more complicated behavior placed in 
separate nodes. Once again, trivial entries can be left out. 

For each pince-nez, the most general data is 

(X,H;7^;0,J;X) 

where X is the input compact space, H the input Hilbert space, TZ the observable compact 
space, O the output compact space, J the output Hilbert space, and L a completely-positive, 
norm-preserving (on the positive cone) map in B{C{1Z x C>;/C(J)), C(X;/C(H))). As previ¬ 
ously noted (see 1 |3.2[ ), the spaces can be locally-compact if L is further restricted. Again, 
trivial entries can be left out (however, if TZ is trivial in the sense of being the one-point set, 
then the pince-nez is actually just a node). 
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4.4 Example-double-slit experiment 


So far, the exposition has been quite abstract. To make things more concrete and demon¬ 
strate how to use Bayesian networks with linear maps (both classical and quantum) in 
practice, consider the following example. There is a black-box with a dial which can be set 
to any position 0 on a circle, a switch with two settings, and four lights, with lights one and 
two forming one pair and lights three and four forming another. After study, it is determined 
to have the following properties: if the switch is off, periodically one of the second pair of 
lights flashes while the hrst pair never flashes, whereas, if the switch is on, one of each pair 
flashes. After more study, it is determined the behavior of each round is independent and, 
if the switch is off, with probability cos^ 6 the third light alone flashes and with probability 
sin^ 9 the fourth light alone flashes. If the switch is on, the joint probabilities are as follows: 

hrst on, third on, others off I 

hrst on, fourth on, others oh I ,, ,. 

f (4.4) 

second on, third on, others oh | 

second on, fourth on, others oh | 


with the other probabilities being zero. Presenting the information in the manner of the 
preceding joint probabilities corresponds to the following graphical model: 


dial 


switch 



(4.5) 


Suppose the box is now opened and is found to be composed of hve modules. A cable 
runs from the hrst module to the second. Two cables come from the second module, one 
each running to a pair of seemingly identical modules, each of which has one of the hrst pair 
of lights. The switch activates switches on the pair in unison. These modules have cables 
running to the last, which has the dial as a control. If possible, given the joint probabilities, 
we should conhne ourselves to models which are consistent with the constraints implied by 
these observations because we would like to be able to predict what would happen if these 
modules were rewired or taken out and placed in a diherent context. 
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4.5 Quantum model 


Consider first a quantum model with graphical model: 



The blue 1,2,3,4,5,6 are for reference. The information presented in the form indicated by 
the graphical model is as follows: 


node 1 (; C^; Li), where Li is the constant map with value 

node 2 (C^;C"^;L 2 ), where L 2 {p) is 


1 




■ 1 1 ■ 

V2 


1 1 


2 2 

1 


x/2 x/2 


1 1 





.2 2 . 


0 0 0 0 
0 0 
0^0 
0 0 0 0 


(4.7) 


splitters (C'^;C^,C^) 


pince-nez 4 


( 

( switch on, 1 counting ^2 ^ 
1 switch off j ’ measure ’ ’ 

light one on and switch on, ' 
light one on and switch off. 

^ counting . ^ 2 . ^ 

’ measure ’ ’ ^ 

light one off and switch on. 


light one off and switch off. 

y 


(4.8) 
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where L 4 ^{t) is 


I switch off 

0 


1 

0 


1 

0 

0 

0 

^ 1 switch on 

0 

0 

0 

0 


0 

0 

0 

1 

^ 1 switch on 

0 

1 


if light one off and switch off 
if light one on and switch off 

if hrst on and switch on 

if first off and switch on 


45 


(4.9) 


pince-nez 5 Same as for pince-nez 4 except the second light replaces the hrst light, 
pince-nez 6 



third and fourth on. 


\ 

Lebesgue , _ 
measure 

third on, 
fourth on. 

counting 
’ measure ’ 

; -he 

V 

third and fourth off 


/ 


where Lq{t) is 


r*27r 


■ 1 0 0 0 ] T{e) 


1 

0 

0 

0 


d9 if both third and fourth on 


f*2n ^ 


0 0 


t{9) 


0 

J_ -40 

0 


d9 if third on 


r*27r ^ 


0 0 


t{9) 


0 

_ Tpid 

72® 

0 


d9 if fourth on 


r*27r 


'0 0 0 1 ] t{9) 


0 

0 

0 

1 


d9 if both third and fourth off 


(4.10) 


Using this information, the joint probability, given that the switch is off and the dial is set 
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to 9, for both the first pair being off, the third light being on, and the fourth light being off, 
is then given by, for positive, unit-norm g G L^([0, 27r), Lebesgue), 


Le {L 4 ® -hs) {L 2 (-hi) Iswitch off) I first pair is off, 9 


pair : 
switch is off 


third is on, 
fourth is off 


r*27r 


0 A"*'' 0 


0 

1 

0 


r-27r 


g{e')de'= / cos^ 9'g{e') de' 


(4.11) 


(4.12) 


For g sufficiently peaked about 9, the result is approximately cos^ 9. 

Similarly, the joint probability, given that the switch is on and the dial is set to 9, for 
both the first and the third light being on, with the second and fourth being off, is then 
given by 


/ 




(F4 -hs) (-^2 (-hi) Iswitch on) | first 


V 


is on, 9 
second is off 
switch is on J 


(4.13) 


third is on, 
fourth is off 


-27r 


^ AW a 


0 

1 

F2 

0 

0 


9{9') d9' = - 


(4.14) 


The other joint probabilities can be calculated similarly and agree with the joint probabilities 
originally determined for the black box. 


Comments on the use of option I’ 

Note the use of option I’ leads to extra complexity concerning the dial position 0, forcing 
the introduction of a function g peaked at 0, although in this example we are not concerned 
about modelling the uncertainty of that value. In this case, using option II would have 
been slightly simpler. Then, instead of using counting measure as the base measure on the 
discrete sets, the discrete topology would be used to make the discrete sets into compact 
spaces. 
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4.6 The Bayesian network approach versus the 
standard textbook approach 

Interpreting the switch as controlling the operation of a position “measurement” at two 
slits and the dial as selecting a point on a backing screen for another position “measure¬ 
ment”, there is a well-known standard textbook approach using wavefunctions, projectors, 
and Bohm’s postulate that replicates the outcome. By comparison, the Bayesian network 
for this particular problem may appear cumbersome; however, that is largely a result of 
familiarity with the hrmer. With the network approach, there are none of the seemingly ad 
hoc rules for dealing with quantum systems; instead, only the simple conditions of positivity, 
linearity and potential universality. Also, the graphical model is highly intuitive and guides 


the writing of the correct expression (4.13). This is of great importance in dealing with more 
complicated systems. 

In addition, as will be illustrated by more complicated examples to follow (see §??), the 
network approach is far more flexible. Of particular interest, in some special cases it allows 
the needed Hilbert spaces to be kept to reasonable sizes in the course of the calculation 
instead of ballooning exponentially. To be more explicit, for a Hilbert space of dimension n, 
the space of operators has dimension of order rP, and the space of maps on these operators 
has dimension of order which is why it appears cumbersome. However, if, by utilizing 
the flexibility of the network approach, one is able to avoid dealing with Hilbert spaces of 
dimension and operator spaces of dimension where N is the number of particles, 
the potential savings is tremendous (as will be seen in §??). 


4.7 Classical hidden model 

For the same black box, now consider a classical hidden model with the same graphical model 
(gg I except now the hidden nodes and connecting arrows will be red). For simplicity, 
functions on discrete spaces indexed by numbers will be given as column vectors. The 
information presented in the form indicated by the graphical model is as follows: 


node 1 ( ;|1,2}, -r] where Li is the constant map with value 

' ^ measure ' 


2 

1 

L 2 


node 2 ( {1,2}, ; {(1,1), (1,2),..., (4,4)}, ;L, |, where LM is 


[000000 Pi 00000 P2 000 ]' 


(4.15) 


splitter 3 ({(1,1), (1,2),..., (4,4)}; {1, 2, 3,4}, {1,2, 3,4}) with the obvious identihcation. 
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pince-nez 4 


switch on, 
switch off 


X {1,2, 3,4}, 


counting 
measure ’ 


{ light one on and switch on, 
light one on and switch off, 
light one off and switch on, 
light one off and switch off. 


\ 

counting r. ^ counting 

, |i, Z, O, 4j-, , 474 

measure measure 

/ 


(4.16) 


where 4^4(r) is 


I switch off 

0 

" 0 0 0 0 
110 0 
0 0 0 0 

_ 0 0 0 0 

" 0 0 0 0 

0 0 0 0 

0 0 0 0 

0 0 11 


if light one off and switch off 
if light one on and switch off 


'TI switch on if first ou and switch on 


'T I switch on if first off and switch on 


(4.17) 


pince-nez 5 


Same as for pince-nez 4 except the second light replaces the first light. 


pince-nez 6 


'{(l,l),(l,2),,..,(4,4)}x|0,2^), X 

measure measure 


(4.18) 


third and fourth on, 
third on, 
fourth on, 

third and fourth off 


counting 

) ) -^6 

measure 


where Lq{t) is 


r*27r 


1111110000000000 ] t( 0 ) de 


(4.19) 


if both third and fourth lights are on. 


r-27r 


■ 0 0 0 0 0 0 cos2 0 1 0 0 0 0 cos^ 0 1 0 0 ] r(0) dO (4.20) 
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if the third light is on and the fourth is off, 


f27r 


[ 0 0 0 0 0 0 1 0 0 0 0 1 0 0]r(»)<i« 


(4.21) 


if the fourth light is on and the third is off, and 


l^2n 


[0000000011110011] T { e ) de 


(4.22) 


if both third and fourth lights are off. 

Using this information, the joint probability, given that the switch is off and the dial is set 
to 6, for both th e first pair being off, the third light being on, and the fourth light being off, 
is then given by (4.1l|), which is cos^ 9' g{9') dO' ~ cos^ 9. Similarly, the joint probability, 
given that the switch is on and the dial is set to 9, for both the fi rst and the third light being 
on, with the second and fourth being off, is then given by (|4.13), which is \g{9') d9' = \. 
The other joint probabilities can be calculated similarly and agree with the joint probabilities 
originally given for the black box. 


4.8 What is a quantum system? 

For the two-slit experiment, which only required hnite-dimensional linear algebra in the 
quantum model (apart from the already commented on problem of inputting the dial setting 
0), a classical model with the same behavior also only required hnite-dimensional linear 
algebra. This is atypical. As we have seen in §3.5[ a classical model that duplicates the 
behavior of a quantum model is generally far more complicated. Also, these classical models 
are generally inherently contextual. 

A reason why the name “quantum system” could still be applied to this example is 
universality: while both models have been constructed to be potentially universal, testing 
will reveal if they fail in this regard. If the black boxes were opened to reveal a laser, beam¬ 
splitters, photon detectors, and so on, from experience we would have a lot of conhdence 
that the modules in the quantum model are universal, whereas we would have very little 
conhdence for the modules in the classic model to have that property. Conversely, if the 
boxes were opened to reveal regular computer circuits, the situation would be reversed. 
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Chapter 5 

Relation to textbook quantum 
mechanics 

5.1 Textbook rules for quantum mechanics 

The quantum Bayesian network structure developed so far has constraints only arising from 
the requirements of positivity, linearity, and potential universality. It is not obvious that it 
has any connection necessarily to what is usually thought of as quantum mechanics. Using 
option I’, that is not the situation, as is shown below. 

However, while the following justihes the usage of the name quantum for the extended 
Bayesian networks, it should not be taken as a justification of them-quite the opposite. The 
Bayesian network approach is predicated on a very reasonable basis. The textbook approach 
to quantum mechanics is only comfortable due to familiarity; on its own merits its rules are 
incomprehensible and unmotivated. Hence, the following is better taken as a justihcation 
of the textbook approach. In other words, if quantum behavior had hrst been discovered 
as forecasting arising from a reasonable extension of Bayesian networks, and the usual list 
of textbook rules were later discovered picking out a certain subset of networks that were 
sufficient to model any situation, no one would take those rules as primary. 

Rule one 

Textbook quantum mechanics imposes an additional rule: only networks in the form of 
chains are permitted and all the Hilbert spaces along a chain must be the same (apart for 
terminating trivial spaces). (This rule may arise from imagining that we have some sort 
of universal, initial-value, dynamical system). Mathematically, this rule puts no additional 
constraints on the network formulation; it is always possible to obey this by working in a 
sufficiently large Hilbert space and waiting until the end of the calculation to perform the 
reduced traces arising from terminal nodes. However, doing this in practice is unnecessarily 
difficult-the whole point of Bayesian networks is to try to hnd a model made of simple and 
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(hopefully) universal parts where the calculations can be done in a manageable manner. 
Therefore, similarly our treatment of determinism in 12.6, we will discard this rule. 


Rule two 

Another rule from textbook quantum mechanics is: the linear map for a quantum node can 
only be of the form Lp = UpU*, where U is unitary. As we prove in theorem 5.1.1 below, 
this is no limitation at all mathematically if option I’ is employed. If option II is used, 
the theorem still holds if H is separable and if there is some strictly positiv^ hnite Radon 
measure on X. For any node with data (X, p, H; J;X), the map L can be represented 
as the sequence of three operations, using some auxiliary Hilbert space M: (i) injecting 
p G X^(X; p] X’(H)) as a density matrix on a Hilbert space K (g) L^{0; v)® I utilizing a partial 
isometry; (ii) taking a reduced trace of the density matrix over K; (iii) mapping the resulting 
density matrix on i/) gJ into L^{0] z/; X’(J)). By introducing an auxiliary Hilbert space 

L with 

L\X;p) ®H®1 = L^{0;iy)® I® M ( 5 . 1 ) 

and some hxed template density-matrix on L, the partial isometry can be upgraded to an 
unitary operator. In terms of the graphical model, this means replacing a node with the 
graph fragment: 



( 5 . 2 ) 


Of course, just because we can do this does not mean we must, or that we should. To obey 
this rule, a great deal of extra computation and irrelevant, arbitrary choices must be made 
to no practical beneht. Therefore, this rule will also be discarded. 


Rule three 

Yet another rule from textbook quantum mechanics is: the linear map for a terminal pince- 
nez (a destructive “measurement”) can only be of the form J^Lpdu = ti Esp, where 
{Eb}b&£ are a complete set of mutually commuting, orthogonal projector^ For the other 

^Every open set gets measure greater than zero. 

is a complete set of mutually commuting, orthogonal projectors for a tr-algebra £ of subsets of a 
set X if they are all mutually commuting, orthogonal projectors with E^s = 0, Ex = I, and E\j, ^Bj 

for all countable, disjoint collections {Bj} C £, with convergence of the sum in the weak* topology induced 
on ;B (H) by its being dual to the trace-class operators (H). In this case, convergence in the ultrastrong- 
operator, strong-operator, ultraweak-operator (same as weak*), and weak-operator topologies on /B(H) are 
all equivalent (see A2.1). 
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pince-nez, only a-algebras generated by a countable set of atoms are allowed, and the map 
must be of the form 


Lpdu = E EAjPEAj for {Aj} a partition of B 


' B 


j 


(5.3) 


This rule can also be stated in the form that every “measurement” has an associated op¬ 
erator, which are the coordinates r (taking values as coordinates in the discrete case) for 
TZ acting on in this formulation. An even stronger formulation requires the result 

of any “measurement” to be an eigenvalue of the associated operator 19 . This implicitly 
assumes any results accrued to unions of atoms is due to post-“measurement” garbling and 
not inherent in the “measurement”. Furthermore, if the eigenspace corresponding to any 
eigenvalue has dimension greater than one, this formulation fails to distinguish among the 
myriad of possible complete sets of mutually commuting, orthogonal projectors consistent 
with its prescription, giving rise to the “three-box paradox” if there is post-conditioning. 

Again employing option I’, as we prove in theorem 5.1.1 below, this third textbook rule 
also imposes no mathematical limitation. Using option II, the theorem still holds with the 
same limitations given before. For any pince-nez with data (X, p,H]IZ, r; (9, z/, J; L), the map 
L can be represented as the sequence of four operations, using some auxiliary Hilbert space M; 
(i) injecting p G X^(X; /i;X(H)) as a density matrix on a Hilbert space L‘^{TZxO] rxz/)(8)J(8)M 
utilizing a partial isometry; (ii) making a projective “measurement” on the r) portion; 

(iii) taking a reduced trace of the density matrix over L; and (iv) mapping the resulting 
density matrix on ® J into L}{0]V]'D{T)). By introducing an auxiliary Hilbert 

space L with 

X^(A;/i) 0 H 0 L = L^{TZ x 0 ]t x u) ® I ® M (5.4) 

and some hxed template density-matrix on L, the partial isometry can be upgraded to an 
unitary operator. In terms of the graphical model, this means replacing a pince-nez with the 
graph fragment: 



(5.5) 

Once again, just because we can do this does not mean we must, or that we should. To obey 
this rule, a great deal of extra computation and irrelevant, arbitrary choices must be made 
to no beneht. Therefore, this rule will also be discarded. 


The main theorem 


The statement of the aforementioned theorem is: 
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Theorem 5.1.1 Any completely-positive map L G CP(L^(X; p; 11(H)), L^(7?. x 0 -,t x 
n;'Zl(J))) that is an isometry on the positive cone can be expressed as 

Lp = (diag^2(7j.^) (g) (diagi 2 (o.^) (g) /^(j)) 0 tr M)^((diagi2(j.^) 0 Ii3{h))~^p)V* 

for some Hilbert space M and some partial isometry V G (I; p) 0H, iJZ x 0 ]t x p)® 
J0M). If all the Hilbert spaces (X; /i), H, X^ (7^; n), X^ (O; n), and J have hnite dimension, 
then M can be taken to be J 0 dimX^ (X; p) 0 H. 

Proof Apply B5.27. □ 


Comments The full proof is quite lengthy, occupying from B5.17 to B5.27 in the ap¬ 
pendix. The following is an outline: First a diagonal extraction map diag is dehned in 
B5.17. Using this, we hnd that the dual space to density-matrix-valued, X^-“functions” 
is a von Neumann algebra]^ Then one theorem by Stinespring [118 on completely-positive 
maps on C^-algebras and another by Sakai 109 on representations of von Neumann algebras 
complete the proof. 

For the inverse map in the statement of the theorem, (diag^2(j.^)0/B(H))~^, any particular 
choice in the preimage can be taken-the hnal result is the same regardless of which choice is 
made. Also, as claimed, introducing an auxiliary Hilbert space L satisfying (5.4), the partial 
isometry can be certainly be upgraded (although not in a unique fashion) to an unitary 
operator U satisfying, for some hxed template density-matrix y G X(L), 


U((diagi 2 (x.^) 0 /b(h)) ^p)V* = U(((diag^ 2 (x.^) 0 Ib{h)) V) ® x)U* (5.6) 

for all p G X^(X; p; X>(H)). In addition, the statement of the theorem does indeed indicate 
a projective “measurement”. Let B be any r-measurable subset of 77. Then, for any p G 
V{L‘^{IZ]t))^ 1b as an element of = X^(77;r)* acting on diag^ 2 ( 7 ^.,-)V^ is 

/ diagi 2 ( 7 e.^)(p dr = tr PbP (5.7) 

J B 

where Pb = diag^ 2 ( 7 ^..,-)ls is the orthogonal projector to the subspace of “functions” in 
X^(77;r) that are zero almost everywhere outside B. 


5.2 Weak measurements 


Reproducibility as a criteria 


One motivation for the restriction to projective maps, emphasized by Dirac 20 , is repro 


ducibility: if two successive pince-nez have the same data (and the incoming and outgoing 


Banach algebra is a Banach space equipped with an associative, distributive product satisfying |ja6|| < 
||a||||6||. A *-algebra is a Banach algebra with an antilinear involution. A G*-algebra is a *-algebra where 
||a*a|| = ||a|p. A von Neumann algebra (also termed a W*-algebra) is a X*-algebra which, as a Banach 
space, is a dual space. 
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data are compatible), then conditioning on the observation for the hrst being in some mea¬ 
surable subset B <zTZ, the probability measure for the second observation has the property 
that on measurable sets that do not intersect B, it gives zero, whereas on measurable sets 
that contain B, it gives one. The Bayesian network for this set-up is: 



1 1 (5.8) 

where the repeated ‘I’s indicate the data for the two pince-nez are the same. 

While this criteria looks superhcially attractive-as is implied in the saw “measure twice, 
cut once”-its flaw is readily apparent. Suppose I were asked to see whether or not a light 
in another room were off. If I see the light is off, I return and report it being off. If I see 
it is on, I turn it off, then return and also report it being off. This is repeatable, but does 
not correspond to what we generally mean by the measurement of the state of the light. 
While this example may seem obtuse, the point is that mathematically projective maps in 
quantum mechanics act in this manner-they have a signihcant effect even if the results are 
ignored. 

Weak versus strong maps for pince-nez 

Building from the preceding example, if the incoming and outgoing data for a pince-nez are 
compatible, then the map L is termed strong if, when marginalizing over the observation, the 
result (using option I’), J^L(p) dr, is far from p for all but a small subset of p’s. Conversely, 
the map L is termed weak if L(p) dr is close to p for a large subset of p’s. Obviously, there 
is a continuum of possibilities; the strength could be varied depending on an input parameter. 
For instance, in the double-slit example previously considered, the switch control could be 
replaced by a continuous slider that would slowly vary the joint probabilities of the output. 

Now we can easily understand the misconception leading to the repeatability criteria; in 
the classic case, restriction maps (using diagonal projectors) are both repeatable and the 
weakest possible, having no effect if their observations are ignored. In the quantum case, 
however, while projective maps (using orthogonal projectors) are still repeatable, they can 
also be very strong. 


Information as a criteria 


Another misconception is that the result of any pince-nez map can be duplicated by a 
projective map followed by a garbling of the observation data[^ If this were the case, then, 
by convexity, the informatioij^ using a projective map would always be higher than for any 


20 


^For instance, Dirac seemed to believe “measureme nts” w ere always inherently projective. 

^Information in the sense of the Shannon definition 115 for the probability measure on the observation 
set TZ. 
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other map in every situation. Consequently, the claim by Kochen and Speaker that projective 
“measurements” 
this is false. 


give maximal knowledge 40 would be correct. However, it is easy to see 
For instance, the matrices 



r ^ 01 


■ 1 1 ■ 


■ 1 

1 ■ 

01 = 

3 y 

[0 IJ 

5 02 — 

! ! 
.5 3. 

CO 

5 

3 . 


are all positive, sum to the identity, and do not mutually commute. The map L : V(C^) —)■ 
given by L(p) = (tr 0ip, tr 02P, tr </> 3 P) is completely-positive and norm-preserving (on the 
positive cone), yet cannot be duplicated by any garbling of a projective map, Y^k=i 9jkH Ekp, 
for stochastic matrix G = [gjk] and commuting, complete, orthogonal projectors {Ei,E 2 } 
on since the matrices {guEi + gi 2 E 2 , 5 ' 21 -El -|- 5 ' 22 -E 2 , 5 ' 31 -El -|- 5 ' 32 -E 2 } necessarily mutually 
commute. 

Also, even if it were possible to duplicate the behavior of a particular pince-nez map by 
an projective map followed by a garbling of the observation data when the module is used in 
isolation, if the module is then inserted into a larger Bayesian network where conditioning 
is taking place, the difference in behavior between the particular pince-nez map and the 
projective map followed by a garbling may become quite large. In particular, for so-called 
“weak-measurements” , which employ both strong and weak pince-nez maps together with 
both pre- and post-conditioning, the information can be very large. The typical Bayesian 
network for one of these has the form: 



(5.10) 


There is both pre- and post-conditioning; those pince-nez have projective maps. The central 
pince-nez has a map that uses “fuzzy” projectors with spread of order y/N for N particles. 
Hence, the relative standard deviation in the values assigned to the observation set is only of 
order so the information is high; furthermore, the center of the peak of the distribution 
can be in unexpected ranges of the observation set, as has been well-publicized recently 13 
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5.3 Comparison of Bayesian networks to quantum 
circuits and tensor networks 


A commonly-used, alternative graphical approach for quantum systems is quantum cir¬ 
cuits 51 . For example consider the following quantum circuit: 



L 




[ 1 



J 





V’i V V’/ (5.11) 


Then for Hilbert spaces H, J, and K, termed the quantum channels, and initial state V’i ^ 
H® J®K, the hnal state is given by (Jh®H)( f/®/K)t/’i for some unitary U : H® J —)■ H® J 
and V :J®K—)-J®K. Quantum circuits are obviously directly related to the usual 
textbook quantum mechanics working with wavefunctions. Following the hrst textbook rule 
from above, there is a single overall Hilbert space, the tensor product of all the Hilbert 
spaces for each quantum channel, that is used throughout. Only unitary operators can be 
accommodated, following the second textbook rule above. 

It is possible to incorporate projective “measurements” into the quantum circuit, giving 
a tensor network, but this requires doubling the graph-clearly not an efficient approach 
graphically. For instance, suppose the initial state is a product state, 'ipi = 'ipi ® 'ip 2 C) 'ips 
and that a projective “measurement” is made on the hnal state, where the projector is of 
the form Jh ® P. Then the calculation of the probability ||/h ® P'lpfW'^ = {Ih ® P'ipf,'ipf) is 
represented by the tensor network 


V'l O- 
^>2 O - 

-03 O 


u 


V 


p 


- ori 

~ -U -OV’2 

u * 

J-^- ors 

V* 


(5.12) 


The tensor network has some of the advantages of the Baysian network in that once it is set 
up, it is possible to look for computational shortcuts. 

On the other hand, the Bayesian network has many advantages. It does not depend 
on the incomprehensible and unmotivated textbook rules, but instead stands on its own 
reasonable basis. It is graphically more efficient in not having to duplicate itself to include 
observations. It is also potentially more efficient in allowing non-unitary nodes and non- 
projective pince-nez, hence avoiding the need to introduce auxiliary spaces, and in having the 
splitter construction, so it is not necessary to maintain all the quantum circuits throughout 
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the diagram. The Bayesian network indicates which observable nodes are being marginalized 
or conditioned on and works seamlessly with the usnal observable Bayesian networks that 
are already in common nse, so it easily allows models with information coming from previons 
observations or random factors (snch as coin flips). For instance, a tensor network coming 
from a qnantnm circnit for the “weak measnrement” example (5.10) wonld be far more 
complicated, with the need for anxiliary spaces, yet wonld still not be able to indicate the 
post-conditioning graphically (the pre-conditioning conld be incorporated into the initial 
valnes for the wavefnnction). 
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Chapter 6 

Further examples 


6.1 No-cloning—classical and quantum 

No-cloning is taken as a hallmark of quantum mechanics. However, it is a property of 
Bayesian networks more generally, holding even if all the Hilbert spaces are trivial. The 
non-linear map Clone sends the density matrix-valued vector measure p io p ® p. This 
cannot be implemented by any device that can be modelled by a Bayesian network since 
such a network, no matter how complicated, altogether gives rise to a linear map on p by 
the principle of linearity. 

One may still consider something a cloning device if it does not always clone, but only 
clones conditioned on a observation; for instance, suppose there is a green light for success 
and a red light for failure. However, to be a cloning device, it must have a hnite probability 
of success for at least two distinct inputs, p and z/. By the principle of linearity, then every 
convex combination tp + {l — t)v of the two has a hnite probability for success. Now consider 
the following graphical model: 


D 


I 0 light 

J cloning 
I device 


testing 

device 


( 6 . 1 ) 


For the device to be cloning, then conditioning on the light being green, the joint probability 
for the observations on the testing device must be the same as those for the graphical model 
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with the Clone map 


D 


Clone 


testing 

device 


L 


( 6 . 2 ) 


with the same corresponding data for the testing device for each model. For every possibility 
of what is put in the dashed boxes and what the corresponding information is, there are some 
linear maps L from 'D(H)-valued vector measures on A to I^(H (8) H)-valued vector measures 


on AxX and K, M from the latter space to [0,1] such that the joint probability (conditioning 



probability using the mathematical Clone map is iCClone(p) = K{p x p). With the above 
assumption, (M o L){tp + (1 — t)i>)) ^ 0; however, then we must have 

(M o L){tp + (1 — t)v)K{{tp + (1 — t)v) X {tp + (1 — t)Tx)) = {K o L){tp + (1 — t)u) (6.3) 

for all t G [0,1], which is impossible. 

6.2 Teleportation-classical and quantum 

Teleportation is also generally taken as a hallmark of quantum mechanics. However, as will 
be shown, teleportation is also possible in classical hidden models. 

A device is a teleportation device if, when modelled by a Bayesian network, for a hxed 
set X, base measure p, Hilbert space H, and template density-matrix-valued function a G 
X X]px p]V{H ® H)+), it takes the product function p ® a E L^{X x X x X] p x 
p X p; V{H O H 0 H)+) back to p for any p G L^{X] /r;P(H)+) and where a splitter is used 
so only Alice gets the (A,/!, H) that p lives on. Bob gets the hnal p, and there are no 
hidden connections between Alice and Bob, but only classic, observable information. Note 
that because the first textbook rule that the overall Hilbert space is the same throughout is 
unnecessary for the Bayesian network formulation, there is no need to have another output 
from Alice (whose value is irrelevant in the context of being a teleportation device)-this 
leads to simplihcation in the required calculations. 

There is a Bayesian network for such a device using quantum nodes (based on, but 
extending, the calculations of Bennett, Brassard, Crepeau, Josza, Peres, and Wootters 0 ) 
ii X = {1, 2,... ,m}, p is the counting measure, H is C"’, and a is the density-matrix-valued 

function (written as a column vector with m? entries, each a. rP x u^-matrix) — Vec Im ® 


m 


- Vec R ( Vec IrA 


n J 


Note a is the Kronecker product of Copy applied to the uniform 
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distribution with the usual maximally-entangled state. The graphical model is: 


r — — — — — — — — — — ~l 



The double arrow indicates Bob’s observations are identical to Alice’s (perfect communica¬ 
tion). The information presented in the form according to the graphical model is: 


node 1 



counting ^ 
measure ’ ’ ^ 


where Li is the constant map with value p. 


node 2 {1, 2,..., m} x {1,2,..., m}, 

map with value a. 


counting pn (g, j where L 2 is the constant 

measure / 


splitter 3 



counting _ 

measure ’ ’ 


(1,2,. ..,m} X (1,2,.. .,m}. 


counting 

measure 


,C”®CMl,2,...,m}, 


counting 
measure ’ 


pince-nez 4 


1^(1, 2, ... ,m} X (1,2,... ,m} 


counting 

measure 


{l,2,...,m} X (1,2, ...,n} X {l,2,...,n}. 


counting 

measure 



Let n be the n x n-matrix with the nth roots of unity along its diagonal, S be the n x n- 
shift-matrix with a one in the upper-right corner and ones on the subdiagonal, and Q be 
the m X m-shift-matrix with a one in the upper-right corner and ones on the subdiagonal. 
For fc, / G (1,2,... ,n}, let Wki be the column vector (J„ ® Vec R- Then for 

(j, k, /) G (1, 2,..., m} X (1, 2,..., n} X (1, 2,..., n}, LRRjki is given by 

( Vec ((1^2 O wIi)t{I ^2 ® wu)) 


(6.5) 
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node 5 


X X {1,2, X {l,2,...,n}. 


counting 
measure ’ 


(1,2,... ,m}. 


counting ^ 
measure, ’ ^ 


The incoming density-matrix-valued function r will be considered as a m x n x n-array, 
indexed by j, k, I, of column vectors of m entries each, where each entry is a n x n-matrix. 
With this convention, L^{t) is given by 


® In) [{Im ® ® ')) (6.6) 

j=l k,l=l 


Then, for any following testing device, the incoming density-matrix-valued function from 
Bob if Alice observes (j, k, /) G (1, 2,..., m} x (1, 2,..., n} x (1, 2,..., n} is 

L 5 ® Il3(LHX-,fM-,H)))){Ll ® L 2 ))\j f,i l{(i,fc,z)}) (6-7) 

This apparently complicated expression is just p, regardless of which particular values of 
j, k, I were observed by Alice and sent to Bob. 

The classical case 

Note the preceding is meaningfnl for the case of trivial Hilbert space, n = 1, so teleportation 
is not a quantum phenomenon. Another way to achieve the same ontput in this purely- 
classical case is to not have a template shared by Alice and Bob, but rather to have the 
pince-nez for Alice have simply the identity for its map (so the hidden node is not really 
hidden, bnt is actnally observable). Then Alice sends her information to Bob, who makes 
a copy using the information. Finally, Alice and Bob forget what the information was (so 
their nodes are marginalized). The result is that the incoming probability distribution to 
Alice is the same as that ontgoing from Bob. 

However, this second approach is not teleportation because of the forgetting step. Also, 
for the classical case of the teleportation given above, an eavesdropper to the information 
sent from Alice to Bob withont access to the template cannot replicate the probability 
distribution, whereas in the second, non-teleportation approach, the eavesdropper conld not 
only replicate the probability distribution by making his own copy, then forgetting, but by 
not forgetting, would actually have more information. 

Does a teleportation device really teleport? 

Note the graphical model for the teleportation device is simply a graph fragment since the 
output from Bob is not terminated. One may hope its behavior is universal, but is always 
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possible that additional testing will reveal it is not. For instance, before the discovery 
of quantum mechanics, it may have been believed that the classical teleportation discussed 
above was “true” teleportation. It is possible that additional mathematical structures besides 
measures and density matrices (see 13^ above) can be used to calculate probabilities in 
Bayesian networks and that these are found to be useful in practice-then the current belief 
that quantum telportation is “true” teleportation will also be shown false. 


6.3 Bell’s inequality for Bayesian networks without 
metaphysical limitations 

Introduction 

The standard proofs of Bell’s inequality make assumptions based on assuming the un¬ 
derlying reality of hidden variables, which in our language is equivalent to the existence of 
transition probability function. As we explored in §2.2| , this is a metaphysical notion for 
hidden nodes. For our Bayesian networks, no such limitation is placed, which enlarges the 
space of possible maps that can be employed. Although, as we already mentioned in §2.2[ 
these additional maps are not necessarily of great interest, we would like to show that Bell’s 
inequality still necessarily holds. 


Set-up 


Following Clauser, Horne, Shimony, and Holt 15 , consider the case of two observers, Alice 
and Bob, who are at distant locations. Each has a box, connected by a long cable to the 
other, with a dial and two lights, marked zero and one, which light periodically. They 
record their observations, then come together later to compare notes. They hnd that the 
flashes are independent in time and depend on the settings of the dial, a for Alice and f3 for 
Bob, but for each round the lights they each saw were not independent of each other, but 
had a joint probability distribution given by the four functions of a,/?: Prob ((0, 0)|a,/d), 
Prob ((0, l)|a,/d), Prob ((1, 0)|a,/d), and Prob ((1, 1)|q;,/ d). Presenting the information in 
this manner corresponds to the graphical model: 


Alice 
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There are the additional no-signalling constraints on the probabilities arising from relativity: 

Prob ((0,0)|a, /9)-|-Prob ((0, 1)|q:, /?) is independent of/? (6.9) 

Prob ((1,0)|a, /3) -|- Prob ((1, 1)|q:, /S) is independent of 

Prob ((0,0)|a, /3) + Prob ((1, 0)|a, /3) is independent of a 

Prob ((0, 1)|q:, /3) -|- Prob ((1, l)|a, /3) is independent of a 

If these constraints were not met, then Alice and Bob could use their dial settings to trans¬ 
mit information superluminally. However, relativity places no additional constraint on the 
bounds for the value of, for a, a' two settings of Alice’s dial and (5' two settings of Bob’s 
dial. 


Prob ((0, 0)|q;, /d) -|- Prob ((1, 1 )|q;, /d) — Prob ((0, l)|a;, /d) — Prob ((1, 0)|a, /d) (6.10) 

-l-Prob ((0, 0) la', /3) -|- Prob ((1,1) |a', {3) — Prob ((0,1) |a', /3) — Prob ((1, 0) |a', {3) 

-l-Prob ((0, 0) |a, (3') + Prob ((1,1) |a, /d') — Prob ((0,1) |a, /d') — Prob ((1, 0) |a, /d') 

—Prob ((0, 0)|a', /d) — Prob ((1, l)|a', j3) -|- Prob ((0, l)|a', /d) -|- Prob ((1,0)|a', /d) 

which can still achieve its bound in magnitude arising from the rules of probability, namely 

four. 

The standard hidden variable approach 

A sufficient way to insure the restrictions from relativity are met is to restrict the possible 
graphical model to the local model: 



( 6 . 11 ) 


Then, if the space of values for the marginalized node is X and its probability measure is p, 
the joint probability both Alice and Bob get 1 is 


Prob (1 for Aliceja, x) Prob (1 for Bob|/d, x) dp{x) 


( 6 . 12 ) 


Ix&X 


with the other joint probabilities given similarly, 
inequality 


Using the standard approach for Bell’s 


15 , it can be shown that for the combination of probabilities in (6.10), the 


bound on its magnitude has now been reduced to two. 
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Using classical hidden nodes 

A sufficient way to insure the restrictions from relativity are met using classical hidden nodes 
is to restrict the possible graphical model to to the local model; 



(6.13) 


The boxes are drawn around the two separate locations to indicate these are potentially 
universal modules. 


Theorem 6.3.1 Employing either option I or II, Bell’s inequality still holds. 


Proof Using option II, by the Riesz theorem 78 , each Prob ((j, fc)|a,/d) is necessarily of 
the form 

/ fj(y;a)gk(z;a) dfi(y,z) (6.14) 

Jyxz 

for some positive, continuous functions /i, / 2 , Qi, 92 with /1 + /2 = lyx[o, 27 r), 91 + 92 = l.zx[o, 27 r) 
and some unit-norm. Radon measure fi on y x Z. This is mathematically the same form 
as (6.12), if lacking the interpretation in terms of conditional probabilities and probability 


measures, so the standard argument still applies with bound two for (6.10). 
Using option I, each Prob ((j, k)\a,/3) is necessarily of the form 


{L2Z Ls) {fi X p X 12) {{{j,k)}) (6.15) 

where p is some unit-norm measure on [ 0 , 27r) concentrated about a from a collection of 
measures AI 5 , v is some unit-norm measure on [ 0 , 27r) concentrated about f3 from the same 
collection of measures, p is some unit-norm measure on y x Z from some collection of 
measures AI 4 , L 2 is a norm-preserving (on the positive cone) map from measures on [ 0 , 27r) x 
y in some collection of measures A42 to measures on { 0 , 1 } with set algebra the power set 
{ 0 , { 0 }, { 1 }, { 0 , 1 }}, and L 3 is a norm-preserving (on the positive cone) map from measures 
on ^ X [0, 27r) in some collection of measures AI 3 to measures on {0, 1} with set algebra the 
power set {0, {0}, {1}, {0,1}}. From the rules for option I, all the collections of measures are 
absolutely-continuous-complete and are such that + 2 ®+^, is well-dehned. However, by A1.2, 
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A1.3, B1.3, B1.4, and B2.5, this implies the bound can be established just be considering 
Prob ((j, k)\a, (3) of the form 


(Ls ® ^3) ( P X I ({(i> ^)}) 


(6.16) 


where 'Yhi R x is a unit-norm, hnite-tensor-rank measure in AI 2 ® AI 3 . Using this form 


for Prob ((j, /c)|a,/3), the usual bound of two readily follows for (6.10). □ 


Comments 

Using the well-known quantum model of Clauser, Horne, Shimony, and Holt (which 
reuses some of the modules from the double-slit experiment (see ^4.5 ) in a different arrangement- 
an example of u nivers ality), it is possible to violate Bell’s inequality and the combination of 


probabilities in (6.10) can even achieve Tsirelson’s bound of 2 a/ 2 14 . Any classical hidden 


model duplicating these results must be nonlocal; hence, contextual-certainly the modules 
we used for the classical hidden model for the double slit experiment (see (4.7) would be of 
no use. 

We then have the question of whether Tsirelson’s bound can be broken. This may 
be possible if there are additional mathematical structures besides measures and density 
matrices that can be utilized in Bayesian networks while respecting the principles of positivity 
and potential universality (see (|3.8| above). 
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Part II 

Parrondo’s paradox and a 
Parrondo-like paradox 
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Chapter 7 

A Parrondo-like paradox for an 
one-round game 

7.1 Defining the game 

Instead of the multi-round game considered in the preceding chapter, now consider a single 
round quantum game where the winning criteria is still taken to be that the observation is 
in some specified set A. Instead of having two maps which are combined by a coin flip, the 
map is now fixed and only the initial states are varied. If they were combined in a convex 
combination by flipping a coin, there would clearly be no Parrondo’s paradox since the prob¬ 
abilities depend linearly on the initial state. However, if the initial states were constrained 
to be rank one (hence, described by a wavefunction), instead of a convex combination, one 
may consider the minimizing geodesic joining them. If two initial states both give probabil¬ 
ity greater than one-half for the hrst player, Alice, to win, but there is somewhere on the 
minimizing geodesic where the second player. Bob, has probability greater than one-half to 
win, we will term that a Parrondo-like paradox. 

Suppose there is a continuous control, say a slider with continuous values from zero to 
one, which varies the initial state along a geodesic in the space of rank-one density matrices. 
Using the Bayesian network model, this initial state is input into a fixed pince-nez map and 
the outcome observed. The graphical model is: 



Employing option I’ (see ^3.2), the data for the initial node is ([0,1], Lebesgue; H; iP) for 
some given Hilbert space H. The map K will be determined below in §7.2 The data for the 
pince-nez is (H; 7Z, r;; L) for some given observable set IZ with base measure r. By the rules 
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of the game, it is possible to reduce to the case where the a-algebra is {0,A,A,1Z} with 
base measure r assigning both A and A the value one. Suppose L is also given. 

The probability for the observation to be in set A (so Alice wins) is then 

f (LoK)iydT = ((LoK)iy)jA (7.2) 

Ja 

7.2 Defining geodesics on the space of wavefunctions 

Let Sh C H be the unit ball. For any (,rj & S, let ~ be the equivalence relation ( t] if 
( = UT] for some phase a; G C C and let [■] denote the equivalence classes. Consider a 
wavefunction G Sh- Since only the rank-one density matrix 'ip'if)* = ip{-, ip) is meaningful, ip 
is only dehned up to an overall phase, so what should actually be considered is the equivalence 
class [ip] in the quotient space Sh/ The question is then what is the correct topology and 
metric structure to place on this quotient space. 

Choice one—using the trace norm 

Using the topology and metric structure inherited from the placement of rank-one density 
matrices within all density matrices, consider the choice of metric disttrace on Sh/ ~ given 
by, for any [^], [ip] G S/ ~, 

disttrace ([^], M) = U = 2y^l - \{lp,Of (7-3) 

The last equality holds since spanc/.^, t/} is an invariant subspace of the operator 

(er - i^rr (er - i^r) = (er - (7.4) 

with eigenvalue 1 — |('^,.^)|^ while the orthogonal subspace is the kernel of the operator. 

Choice two—using the round metric on Sh 

Let distround be the round metric on Sh, which is the standard metric induced by its embed¬ 
ding in H equipped with its norm, so, for any ^,77 G Sh, 

distround (C, v) = arccos 3fJ(C, v) (7-5) 

Then let the metric dist....round on Sh/ ~ be given by the usual prescription for quotient 
spaces, so, for any [,^], [ip] G Sh/ ~, 

dist....round([C], [t/]) = min distround(C;= arccos max = arccos 1(77,01 (7-6) 

ineSicC tueSicC 

Then 

disttrace ([0! [0]) 2 siu dist...jround ([0 ) [0]) 

dist^round([0> [0]) = arcsin ^disttrace([0> [0]) 

In particular, the metrics are equivalent and give rise to the same topology on Sh/ ~. 


( 7 . 7 ) 
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Geodesics on the quotient space 

Fix any G Sh with corresponding equivalence classes [■C], [t/’] ^ ^h/ Let ^ be an 
element of [^] such that distround(C,= dist^round([^], M) so I = 

if 7^ 0 while ^ can be any element of [.^] if ('?/’, 0 = 0- Now take the arc 

\W:0\ 


0,distround(^,^) 


7 : 

parametrization, given by 


—)■ Sh of the great circle in Sh connecting tjj and ^ with the standard 


7 ( 6 ') = (cos 9)ip + (sin 9) 


i- 

U-{L 


(7.8) 


where 


= a/(C- = yf “ 


Proposition 8.2.1 The curve [7] in Sh/ ~ is an unit speed geodesic with respect to the 
metric dist....i.ound- 

Proof For any 9 G (^ 0 , distround(^, , 

dist^round([ 7 (^)], bP]) + dist^round([^], [7(6')]) = distround(7(6'), ^) + distround(^^ ( 7 - 9 ) 

disti-ound('C) t/) 

dist ...around (['C]) [1/]) 

SO not only is the curve [7] a geodesic, but by further partitioning, the metric is seen to be 
precisely the arclength of the geodesic. Also, for any 9 G ^ 0 , distround(C) t/) j > 

1 1 lei 

lim - dist....round([ 7 (^+ £)], [7(^)1) = lim - arccos (cos(e)) = lim — = 1 ( 7 . 10 ) 

£—>■ 0 + E £—>■ 0 + £ £^ 0 + £ 

SO the curve is unit speed. □ 

Proposition 8.2.2 The curve [7] in Sh/ ~ is a geodesic with respect to the metric disttrace- 

Proof Since 2sin;2 = 2z + O(z^) as z 0, the arclength of a curve with respect to the 
metric disttrace is simply twice the arclength with respect to the metric dist....round- Hence, 
the two metrics have the same geodesics.□ 



CHAPTER 7. A PARRONDO-LIKE PARADOX EOR AN ONE-ROUND GAME 


70 


Completing the definition of the game. 


We still need to define the initialization map K from §7.1 which uses the slider position to 
determine a point on the geodesic. From the preceding results on geodesics, K is determined 
by hrst fixing the endpoints [.^], [ip] G Sh/ Let 6 = dist..^round(['C], [ip])- Then, for z/ a 
measure on [0,1] absolutely continuous with respect to Lebesgue measure A, 




'xe[o,i] 


'j{x6)'y{x6)* du^x) 


(7.11) 


If z/ is sufficiently concentrated, then, as desired, the image of K is approximately rank-one. 
The complication of dealing with a concentrated measure z/ for the slider position is another 
instance of the previously mentioned problem (see ^3.2 and |:4.5) of inputting parameters 


encountered when employing option I’-the value of the slider cannot be read in directly. 
For purposes of simplicity and clarity, in the remainder we will instead simply suppose we 
can directly select the desired point on the geodesic and work with strictly rank-one density 
matrices. 


7.3 Bounds on the extent of the Parrondo-like 
paradox 

To quantify the extent of the Parrondo-like paradox, by analogy to §?? we have Pa, P^, 
as the probability Alice wins with initial wavefunction [^] and [ip] respectively. Define P|®° 
as the probability Alice wins with initial wavefunction for a specified point on the geodesic 
joining [^] and [ip]. Analogously to definition 7.5.2, we have the following: 

Definition 8.3.1 The quantum allowed region, denoted Q{A,1Z,t,H), is the set of all 
(Pa, P(i, Pa^°) G [0,1]^^ that occur-for fixed A, 77, r, and H -over all allowed pince-nez maps 
L, initial wavefunctions [.^] and [ip], and points on the geodesic joining them. 

The paradox can occur if Q intersects the cube (|, l] x (4^ l] x [O, 4). We have the following 
theorem giving Q precisely: 

Theorem 8.3.2 If the Hilbert space H is nontrivial, Q is the closed region 

max{0. Pa + P; - 1} < PT < min{PA + P^, 1} 

Proof Since the bounded operators are dual to the trace-class ones, there is some self- 
adjoint operator rj G P(H) such that T(p)|a = tr TIP for every p G P(H)+ and, by B5.15 and 
B5.16, every p G iSi(H). The condition on rj is that it is in the order interval 0 < p < Iy\. 
Let the 2 x 2-, Hermitian matrix B = [bjk] be given by 


■ HTpr)\A 

L(<i’h\A ■ 


{pip,ip) 

1 - 

. L{ir)\A 

i(«')U . 


. (vLtP) 

(7^,0 . 


(7.12) 
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Let B be the set of all such matrices over 0 < 77 < Jh and 7 /’, ^ G Sh- 

To get a simpler characterization of B, let C be the union of order intervals of 2 x 2-, 
Hermitian matrices given by 


U (“SC'S 

5e[0,f] L 


1 cos (5 
cos S 1 


(7.13) 


The claim is that B = C. To see this is true, take any B E B. Since ?7 > 0, for any a,b E C, 


a b B 


= a b 





a 


b 


= {T]{ail) + bCj,ai) + bi) > 0 

(7.14) 


Therefore, B >11. Similarly, since /h — "7 > 0, for cos 5 


a b 


1 cos 5 
cos S 1 


-B 


a 

b 


= a b 


{{Ih- r|)^p,^p) 



a 


b 


(7.15) 


= ( (^H - + bi),a'ip + bi 

for 6 = aTCCOs{'ip,^) E 


1 cos 5 
cos S 1 


1 cos 5 
cos 6 1 


which is always greater than or equal to zero, so B < 

[0, |]. Hence, B C C. 

Now take any C G C, so there is some 6 E [O, |] such that C < 

Since H is nontrivial, it has a pair of orthonormal vectors, {61,62}. Take ip = ei and 
= cos(56i + sin(562, which are both clearly of unit norm. Take the operator rj to be 
zero on the complement of the span of {61, 62}. On the span, using {61, 62} as the basis, let 
f] be given by 


(7.16) 


C = 

L 

7] = 

Then, since C > 0, clearly 77 > 0 . Since C < 
[a 6 ] (Jh - 77) 

Hence, 77 < Jh, so C C H and B = C. 



(#,d 


■ 1 o' 

77 

1 cos (5 




cos (5 sin (5 

0 sin 5 


1 

0 

-1 

C 

■ 1 

cos 6 

cos (5 

sin (5 

0 

sin (5 


-1 


a 


b 

= v Q 


1 cos (5 
cos 6 1 

1 cos (5 
cos (5 1 


, for V = 


1 cos S 
0 sin 6 


-1 r 


a 

b 


— O V > 0 


(7.17) 
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Therefore, using the above expression (7.8) for the geodesic and ||^ — = sin^, 

we wish to extremize 


pr = 


1 


sin"^ 5 


[ sin(5 — 9) sin 6 * ] B 


sin((5 — 6) 
sin 6 * 


(7.18) 


over all 0 < 5 < 


1 cos S 
cos 5 1 

Setting the imaginary parts of t 


, 5 G [0, |], and 9 G [0, 5] for hxed Pa = &ii and P'a = b 


22 - 


le off-diagonal entries of B to zero keeps B in the allowed 


order interval and does not change the value of Pa, P'a, or ^ taken real. 


For hxed 5 and 9, the expression in (|7.18|) is linear in B. Since the allowed B form a 

points, so B can be restricted to 

B be rank one. For case (i), we 


convex set, the extrema are achieved on the set of extreme points, so B can be restricted to 

1 cos 5 
cos 6 1 

have two subcases for the choice of either -|- or — in 


either (i) being rank one or (ii) having 


B = 


bn 

±Vbnb22 


± a /6 ii 622 

^22 


(7.19) 


Choosing the -f, then bn, &22 G [0,1] with 


max{ 0 , \/bnb22 - a /(1 - 6ii)(l - &22)} < cos <5 < min{l, \/h^2 + \/(l - - &22)} 

( 7 . 20 ) 

and 9 G [ 0 , 5 ]. Let /™'^^( 5 ii, ^22) be the maximum of ( 7 . 18 ) over all allowed 5,9 for the 
given 611, &22 and /™“(5ii,522) be the minimum. Then /™^""( 5 ii, ^22) = min{6ii -|- 622,1}, 

with the maximizing 50,6^0 given by 5 o = f, sin 6^0 = \J bil+b22 + 622 < 1 and by 

cos 5 o = V611622 —\/ (1 — 6ii)(l — 622), sin 00 = a/I — 611 if 611-I-622 > 1 - The minimum bound 
is /™“(6 ii, 622) = niin{6ii, 622}, with the minimizing 5 o, 9 o given by 0o = 0 if 611 < 622 and 
00 = 5 o if 611 > 622, with 5 o arbitrary. Choosing the —, then (611, 622) G [ 0 , 1 ]^^ fl {611 -|- 622 < 
1} with 

0 < cos 5 < min{l, y/(l - 6ii)(l - 622) - ^611622} 


(7.21) 


and 0 G [0, 5]. Let /™^^( 6 ii, 622 ) be the maximum of (7.18) over all allowed 5, 9 for the given 
611,622 and /™“( 6 ii, 622 ) be the minimum. Then /™’^^( 6 ii, 622 ) = max{ 6 ii, 622 }, with the 
maximizing 5o, ^0 given complementary to that for the preceding /™“. The minimum bound 

is /™“( 6 ii, 622 ) = 0 , with the minimizing 5o, 6*0 given by 5o = |, sin 0 o = 


1+^22 ' 


For case fiij, once again we have two subcases for the choice of either -|- or — in 

;5 ± a/(1 — 6 ii)(l — 622 ) 


B = 


ni 


COS( 


_ cos5 ± ^(1 - 6 ii)(l - 622 ) 622 

Choosing the -|-, then ( 611 , 622 ) G [0,1]^^ fl {611 -|- 622 > 1} with 


(7.22) 


0 < cos5 < min{l, ■\/bnb 22 — - 611 


(7.23) 
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and 9 G [0, 5]. Let ^ 22 ) be the maximum of (7.18) over all allowed 5, 9 for the given 

611,622 and ^ 22 ) be the minimum. Then /““(& 11 , ^ 22 ) = 1, with the minimizing ^0 

given by 60 = sin^o = \J 2-b^l-i^ - The minimum bound is /™™(&ii,&22) = niin{6ii,622} 
with the minimizing 6 o, 9 o given similarly to that for the preceding /™°. Choosing the —, 
then bii, 622 ^ [0,1] with 


max 


{ 0 , \/(l - bn)(l - ^ 22 ) - a/^ii& 22 } <cosS< min{l, \/(1 - - 622 ) + V^J^} 

(7.24) 


if 


and 9 E [0,5]. Let ^ 22 ) be the maximum of (7.18) over all allowed 6,9 for the 

given 611,622 and /™™( 6 ii, 622 ) be the minimum. Then /™’^""( 6 ii, 622 ) = min{ 6 ii, 622 } with 
the maximizing 605 9o given similarly to that for the preceding The minimum bound is 

/““( 611 , 622 ) = max{ 0 , 611 + 622 — 1}, with the minimizing 60 , 9o given by cos5o = \/l — 611 

VI - 622 - V 611622 , sin 6^0 = a/ 6 V if 611 + 622 < 1 and 60 = f, sin 610 = \J 
611 + 622 > 1. 

Putting the cases and subcases together, the minimum value of for hxed Pa, P'a is 
given by 

(/r A A fr A /f") {Pa, P'a) = ff^PA, P'a) = max{ 0 . Pa + Pa - 1} (7-25) 

The maximum value of P®‘^° for hxed Pa, P'a is given by 

(/r“ V /r^ V v /rv {Pa, p'a) = fr^PA, p'a) = min{PA+pv i} (7.20) 

With hxed values for Pa and P)i, within each case and subcase P|®° is a continuous function 
of 6 and 9, so all intermediate values for P^° are achieved. □ 

Note that, by the result of the preceding theorem, the paradox can only occur for values of 
(Pa, P'a) in the triangle bounded by Pa > \, P'a> and Pa + P^ < §• 


7.4 Conditions for the occurrence of the 
Parrondo-like paradox 


The matrix B dehned in the proof of the preceding theorem (7.12) puts restrictions on the 


occurrence of the paradox. If P < ^l 2 then both Pa and P'^ are less than or equal to one- 
half, so the paradox cannot occur for any choices of [ijj], [^], or point on the geodesic joining 
them. Similarly, if the trace of B is less than or equal to one, then either Pa or Pj^ is less 
than one-half, so the paradox also cannot occur. From these obvious statements, we then 
have the following nontrivial result: 


Theorem 8.4.1 If, for any particular orthonormal ei,e 2 G H, 


tr 


L{eiel)\A L(eie2)|A 
L{e2el)\A L(e2e2)jA 


< 1 
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then, for any normalized ^ span£{ei,e2} and point on the geodesic joining [ip] and [^], 
the paradox cannot occur. 


Proof Let ip = Oi 62 


a 

b 


and ^ = Oi 62 


d 


with |a|2 + |6|2 = |c|2+ |d|2 = 1 


and ac + bd = cos S real and greater than or equal to zero. Let 

C' = 

If C < I/2, then 


L{eiel)\A L(eie2)|A 
-h(e2ej)|A ^(6262)^ 


tr i? = tr 


a b 

C 

a 

c 

c d 

b 

d 


< 1 


( 7 . 27 ) 


( 7 . 28 ) 


so, by the comment preceding the theorem, the paradox cannot occur. Therefore, the only 
remaining case is where C, whose eigenvalues are necessarily real, has one eigenvalue, Ai > 
and one eigenvalue, A2 < with Ai + A2 < 1 . There are corresponding normalized 


eigenvectors vi and V2, necessarily orthogonal. Writing 


a 

c 


= /ivi + /2V2 and 


b 

d 


91^1 + 92^2, the above conditions on a, b, c, d become the following conditions on /i, /2, 91,92'- 
l/iP + I/2P = 1, Ifi'iP + |fi'2p = 1, and fiQi + f2g2 = cos 5 is a positive real or zero. 

For the paradox to occur, it must be that both |/ipAi + I/2PA2 = bn = Pa > ^ and 
Ifl'ipAi + |5'2pA2 = 622 = Pa> I', hence. 



l/i|> 


A 


f 1/2 


Then, since 


\ 9 i\ > \l\ -^ \92 


\ fi 9 i \ = \ fi \\ 9 i \ > y-^ 1/211^2! > 1/211^2! = 1/2^2! 

Ai — 5 


( 7 . 29 ) 

( 7 . 30 ) 

( 7 . 31 ) 


it must be that figi + f292 is actually strictly positive and that ^fi9i > 0 . Furthermore, 
since the imaginary parts of /i^ and /2^ are equal in magnitude, it must be that the real 

part of fidi is greater than times the magnitude of the real part of /2^ and, therefore, 

Ai- 


is greater than 


I-A2 


X times the real part of f292- Hence, rearranging terms, 

Ai — 2 

(-^i/ifi'i + ^ 2 / 292 ) > - (/ifi'i + 1292 ) = 2 cos 5 


( 7 . 32 ) 
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However, then 


pgeo _ 

^ sin^ S 


sm(S — 0) sin0 B 


sin((5 — 6) 
sin0 


sin"^ (5 


((l/il^Ai + |/2|^A2) sin^{6 -6) + (|^i|^Ai + |^2rA2) sin^ 6 


+23? {Xifigi + A 2 / 2 fi' 2 ) sin(5 — 0) sin 0) 

> —^ ( - sin^ (5 — 6*) H— sin^ 0 + cos <5 sin((5 — 0) sin 0 
sin 0 \ 2 2 


-^ (sin((5 — 0) (sin((5 — 6^) + cos 5 sin 0) 

2 sin 0 

+ sin 0 (sin((5 — (5 — 6 *)) + cos 5 sin((5 — 0 ))) 


-^ (sin(5 — 0) sin 5 cos 6* + sin 6* sin 5 cos((5 — 0)) 

2 sin 0 

sin^ 5 1 

2 sin^ 5 2 


(7.33) 


Therefore, if both h\\ — Py^ + ^ and &22 = P'a > then > I everywhere on the 
geodesic. □ 



76 


Chapter 8 

Quantum walks and the Parrondo-like 
paradox 

8.1 Classical random and classical hidden walks 

Definitions of classical random and classical hidden walks 

A classical random walk is a special case of the observable Markov chain earlier discussed, 
with graphical model given in hgure (??)• It models a walker who is in one of a hnite 
number of internal states and occupies one of a countable number of positions at one time, 
so the space X is either J' x Z"*" for a walk on the half-line, or 77 x Z for a walk on the 
full line. After each time-step, the walker is at the same or a neighboring location and its 
internal state can change as well. The process is random, with the transition probability 
functions possibly dependent on the internal state as well as on the location. This constrains 
the transition probability functions further than the constraints already imposed by the 
Markov conditions, but these further constraints are not indicated in the graphical model. 
If the transition probability functions are spatially translation invariant, the walk is termed 
homogeneous. 

Now we may consider a classical hidden walk. This is in some ways a special case of the 
classical hidden-Markov chain earlier discussed (??), while in other ways it is a generalization. 
The hidden set "H is required to be either 77 x Z"*" for a walk on the half-line, or 77 x Z for 
a walk on the full line. The base measure for the hidden set is required to be the counting 
measure, and (77; counting measure) = so all integrals can be taken to just be sums. 
The space has a natural basis of sequences that have a single nonzero entry with value one. 
There is a dual “basis” of sequences in of the same sequences, which is not a basis in the 
norm topology, but is one in the weak* topology. This basis and dual “basis” can be used to 
assign matrix elements to any operator in Then any operator. A, in B{i^) is in one- 

to-one correspondence to a certain sequence of matrices, {An}, which may be considered the 
truncations of the inhnite matrix corresponding to the operator. By the triangle inequality. 
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these sequences converge to the operator in the strong-operator topology: 


lim IIAx — A„x|| = lim ||Ax — P„AP„x|| < lim (||Ax — P„Ax|| + ||P„Ax — P„AP„x||) 

n—>-cxD n—^oo n^oo 

( 8 . 1 ) 

< lim (IKAx) - P„(/lx)|| + ||P„/l||„p||x - P„x||) = 0 

n^oo 


for any x G where {Pn} are the usual, diagonal projectors onto the span of the hrst n 
basis elements. 

Conversely, given a sequence of matrices, each of which is the truncation of the following, 
the condition on the sequence so that it actually corresponds to a bounded operator is that 
the induced operator norm of all the matrices is bounded. However, the operator norm 
induced by the U norm is simply the supremum over all columns of the column sum of 
the magnitudes of the entries, so it is readily calculated. With this form for the maps, the 
condition that this is a walk (rather than some other sort of hidden-Markov process) is that 
matrix entries connecting spatial locations that are not neighboring are all zero. 

The generalization from the classical hidden-Markov chain is that the last pince-nez map 
in the chain is no longer required to be the same as the preceding ones; in particular, all the 
preceding pince-nez can be taken to be simply nodes, so the graphical model is 


1 2 
D-O-O 


O 


n -I- 1 



( 8 . 2 ) 


Note there is nothing graphically that distinguishes this from a more general Markov process- 
the constraints that make it a walk are not represented graphically. 


Connection to orthogonal polynomials and measures on M 

Orthogonal polynomial^ {pj} result from the Gram-Schmidt algorithm applied to {1, x, ... 
on the real line with inner-product given with respect to some Borel measure /i, (/, g) = 
/r f9 dfi- These polynomials all have the maximal number of real roots, which are all simple, 
else they would not change signs enough times to be orthogonal. For the same reason, the 
roots are all within the convex hull of the support of /i and they interlace as j increases by 
one. By orthogonality, the polynomials necessarily obey a three-term recurrence relation, 
which may be written in matrix form as 


x[po Pi ■■■] = [ Po Pi ■■■ ] 


bo Cl 
Oo bi C2 

ai 


(8.3) 


^The following results on orthogonal polynomials are well known and included for comparison to the 
results for quantum walks given below. For details, see 


119 
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If the polynomials are normalized to have valne one at x = 1, the tridiagonal, inhnite matrix 
on the right has colnmn snm one for each of its colnmns. If /r is snch that all the entries 
in that matrix are positive, then it is a stochastic matrix and can be nsed for the map per 
time step for a classical random or classical hidden walk on the half-line where the internal 
states are trivial. 

Conversely, given snch a walk, then there is an inhnite, stochastic matrix 

■ bo 
ao 


Cl 

bl C2 

(Xl 


(8.4) 


giving the map per time step. If all the a’s and c’s are strictly positive, then each n x n- 
trnncation An of the matrix is similar to a Hermitian matrix via 


di 


(^n — 1 - 


bo Cl 
ao bl 

ai 


C 2 


^n—l 
^n—2 bji—i 


di 


dn—1 


(8.5) 


where dj = 


. Therefore, for each snch trnncation, the eigenvalnes ... ,xi”H are 
all real. These eigenvalnes are all necessarily less than or eqnal to one in magnitnde since 
the spectral radins of An is less than or eqnal to its operator norm indnced by the £^-norm, 
which is one. By the Conrant-Fischer minimax theorem 16 the eigenvalues are all simple 


and they interlace as n increases, so they are actually all less than one in magnitude. Dehne 
the polynomials {pn} by 


Pn{x) = 


det{xln - An) _ (x - x^""^) • • • (x - xJTO 
det(/„ - An) (1 _ xf^) • • • (1 - xi"^) 




( 8 . 6 ) 


By adding the hrst row to the second, the second to the third, and so on, it is easy to see 
that det(/n — An) = aoai ■ ■ ■ a„_i. Then, by expanding det(x/„ — An) by minors along its last 
column and evaluation at the n values x G {x^^\ ..., xi"^} (which is enough to determine a 


degree-n polynomial), the polynomial xpn-i(x) obeys the recurrence relation in (8.3). 

Let pi be the single atom measure 6bo and, for n > 1, let pn be the atomic measure 

E n 

i=i 


where 


(n) 

w) = 


E 

1=1 


w 


(fc) 


Pn X'l 




X 


(k) 


— X 


(n) 


P I X 


SO 

T 


(8.7) 
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for any k G {["l],...,??. — l} (they all give the same result). Furthermore, the sequence 
of measures stabilizes for any fixed polynomial in the sense that for any degree-n 
polynomial q, q d^k is the same for all k > [f + ij • Therefore, the fin are indeed measures 
(and not just signed measures) since 



n —1 



W 


(n-l) 

I 


1=1 


Pn 



X 


(n-l) 

I 



p' 



2 


( 8 . 8 ) 


SO all the tc’s are positive. 

Hence, /in(lR) = /jj 1 dpn = 1 is the total-variation norm of pn for each n. Since the pn are 
all Radon measures and the interval [—1,1] is compact, by the Riesz-Markov theorem 78 and 
Alaoglu’s theorem [^, the sequence (p„) has a weak* limit point. Since, by the Weierstrass 
theorem 112 , polynomials are dense in the supremum norm among continuous functions 


on the compact interval [—1,1], by the above stabilizing property of the sequence, the limit 
point is unique and the entire sequence converges to it in the weak* topology. Let this limit 
be denoted p. Then {pn} are the orthogonal polynomials corresponding to the measure 
fi on M. The measure p is unique among Radon measures since any other measure with 
this property agrees with p on polynomials, but, as stated above, they are dense in norm 
among continuous functions on [—1,1], which separate Radon measures; hence, polynomials 
separate these measures as well. 

Furthermore, one may ask if, starting with u such that the tridiagonal, infinite matrix 
has all positive entries, then forming the measure p following the procedure outlined, it is 
necessarily the case that u (x p (there may be a scale factor since p necessarily has total- 
variation norm one). This is true since p and v agree (up to the scale factor) when integrated 
with polynomials on M; however, p is supported on [—1,1] so its moments (and, hence, those 
of v) are bounded. Therefore, by 22 , the moment problem on M has a unique solution in 


this case. 

Finally, there is the question of whether every Radon measure on [—1,1] corresponds to a 
classical random walk. The answer is no, as is seen by the Jacobi polynomials [^, normalizecQ 


to have the value one at x = 1, 


(!+«)■ 


-Pr 




These have measure with support on [—1,1], 


given there by the measure absolutely continuous with respect to Lebesgue measure and 
with Radon-Nikodym derivative (1 -1- x)^(l — x)". For the recurrence relation, the entries in 
the tridiagonal, inhnite matrix for {a„} and {c„} are always strictly positive (as they must 
be for any measure with its support on (— cxd, 1 ]), but the {6„} are given by 

(/S^ — q;^)(2?7, + a + jd + T) 


kn. 


(8.9) 


(2n -\- Q. jd')^ 

which are negative for a > (d. (However, for the Jacobi polynomials shifted to live on the 


interval [0,1] with Radon-Nikodym derivative x“(l — x)^, 
an associated walk-see (^). 


i-l)"n\ p{a,/ 3 ) 
(l+/3)n 


( 1 - 


there is 


^Using Pochhammer’s symbol, (a)o = 1 and (a)„ = a{a -I- 1) • • • (a + ’^ — !)• 
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8.2 Quantum walks 

Definition of a quantum walk 

Similarly to the classical hidden walk, the quantum walk is in some ways a special case of 
the quantum Markov chain earlier discussed (??), while in other ways it is a generalization. 
The Hilbert space H is required to be either x counting measure) for a walk on 

the half-line, or L‘^[J x Z; counting measure) for a walk on the full line; both are clearly 
isometrically isomorphic to C, so all integrals can be taken to just be sums. The space C has 
a natural basis of sequences that each have a single nonzero entry with value one. This basis 
and the inner-product can be used to assign matrix elements to any operator B{C). Any 
operator A in B{C) is in one-to-one correspondence to a certain sequence of matrices, {An}, 
which may be considered the truncations of the infinite matrix corresponding to the operator. 
By the triangle inequality, these sequences converge to the operator in the strong-operator 
topology: 

lim \\A7p - Ani>\\ = lim \\A 7 p - P^AP^V’ll < 1™ {\\A 7 p - PnA7p\\ + llPnAi/j - PnAPn^{>\\) 

n^oo n^oo n^oo 

( 8 . 10 ) 

< lim (Il(.4^') - Pn{Ai^)\\ + ||P„/l||op|K’ - Pni’W) ^ 0 

n^oo 

for any -i/i G C, where {Pn} are the orthogonal projectors onto the span of the first n basis 
elements. 

Conversely, given a sequence of matrices, each of which is the truncation of the following, 
the condition on the sequence so that it actually corresponds to a bounded operator is that 
the induced operator norm of all the matrices is bounded. The operator norm induced by the 
C norm is the largest singular value, which, unfortunately, is not generally readily calculated. 
However, for this sequence to correspond to a partial isometry, it is only necessary to show 
that for all fixed, finite collections of columns, those columns of the {An} are mutually 
orthonormal in the limit as n —)■ cx). It is readily shown that this condition implies the 
induced operator norm of each An is less than or equal to one. With this form for the maps, 
the condition that this is a walk (rather than some other sort of quantum Markov process) 
is that matrix entries connecting spatial locations that are not neighboring are all zero. 

The generalization from the quantum Markov chain is that the last pince-nez in the 
chain is no longer required to be the same as the preceding. In particular, all the preceding 
pince-nez can be taken to be simply nodes, so the graphical model is 


1 2 

D-O-O 


O 


n -I- 1 



( 8 . 11 ) 


Note, as for the classical random walks, there is nothing graphically that distinguishes this 
from a more general quantum Markov processMhe constraints that make it a walk are not 
represented in the graphical model. 
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Orthogonal trigonometric polynomials and measures on 

Trigonometric orthogonal polynomial^{gj } resnlt from the Gram-Schmidt algorithm applied 
to {l,z, z~^, z^, z ~^,...} on the nnit circle within C with sesqnilinear inner-prodnct given 
with respect to some Borel measnre /r, {f,g) = /§i fgdg,. To form these, it is nsefnl to start 
with the monic orthogonal polynomials on the nnit circle, the Szego polynomials, {s„}. For 
any polynomial, dehne the reciprocal polynomial to be the polynomial with its coefficients 
conjngated and flipped in order, so, if p is a nth order polynomial. 


P 


reciprocal 




( 8 . 12 ) 


where the overline only over the fnnction means to conjngate its coefficients. On 
^reciprocalparticffiar, if z is a root of p, then - is a root of 

Z 

Dehne the Verblunsky coefficients by the values of the Szego polynomials at zero, an = 
—s„+i(0). Then, by orthogonality, we have the following Szego recurrence identities for all 
e C: 

2 s„(z) = ii„+, (z) + a;; ) (8.13) 

and 

Sn{z) = + (1 - \an-l\^) z Sn-l(z) (8.14) 

^ 4 ®ciprocal^^^ ^ -an_i Sn{z) + (l “ lOn-lP) 

From the hrst identity, it follows that 

ll-Snir = / |s„(z)|^ d/i(z) = (l - |an_ip) ■ ■ ■ (l - |ao|^)/i(S^) (8.15) 

JzeS^ 


By Verblunsky’s theorem, the measure p, the moments of the measure {mj}, and the 
Verblunsky coefficients {aj} all determine each other. The only condition on the Verblunsky 
coefficients that they do indeed correspond to some measure is that \aj\ < 1 for all j. 

The monic orthogonal trigonometric polynomials are then given by go = I, and, for 
je{l,2,...}, 

qj{z) = z ^h 2 j-i{z),q_j{z) = z ^ (8.16) 


^Only a few basic results that are most applicable to quantum walks of this rich topic are presented here. 
See 117 for details and elaboration. 
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The orthogonal trigonometric polynomials necessary satisfy a pentadiagonal recurrence re¬ 
lation, with first zqo(z) = gi(z) then, using (8.13) and (8.14) repeatedly. 


zgj(z) = z ^z^S 2 j-i(z) = z ^z (^S 2 j(z) + (8-17) 

— j( / \ I - reciprocal / \ . - reciprocal / \ \ 

= z ^ (^S2j+i(2:) a2jS2j O) + a2j-iZS2jTi (zjj 

= Z ^ [S 2 j+l[z) + a 2 jS 2 j ^ (z) 

+za 2 j-l ^-a 2 j- 2 S 2 j-l(z) + (l - \a 2 j- 2 \‘^) 

= gj+i{z) + a^g_j{z) - oU~{oi 2 j- 2 gj{z) + aXJW (l - \a 2 j- 2 \^) g-[j-i){z) 

and 

zq_,(z) = (- 02 ,S 2 , + (l - (8.18) 

— if ( I - reciprocal \ 

— Z ( —Q;2j-1 ( S2j+1 + 0'2jS2j J 

+Z (l — \a2j-l\^') (y—Ol2j-2S2j-l + (l ~ |'^2j-2p) ) 

= -a2j-igj+i{z) - aPja2j-ig-j{z) - a2j-2 (l - \a2j-i\‘^) gj 
+ (l - \a 2 j-i\‘^) (l - |Q! 2 i- 2 r) g-{j-i){z) 

Following Cantero, Moral, and Velazquez [^, writing this in matrix form as 

[ zgo{z) zgi{z) zq-i{z) ■■■] = [ ^ 0 ( 2 :) qi{z) q-i{z) ■■■]Z (8.19) 

gives the CMV-matrix Z for the monic orthogonal trigonometric polynomials. 


(1 — l*^o|^) 

1 

to 

1 

0 

to 

0 ■■■ ■ 

1 — ( XiCHq 

1 

1 

to 

p 

0 

0 

0 

— criQ;2 

^(l-|a2n ■■■ 

0 1 

— «! 

— 0)30)2 

0 0 

0 

04 





The matrix Z can be written as the product of two block-diagonal matrices (the columns 
within each 2 x 2-block correspond to the identities in (8.13) and (8.14)) 


'0)0 1 - |o)oP 


■ 1 

1 —0)0 


Ol 1 — 0)ip 

aS . 1 - 0)2p 


1 —0)1 

1 -0)2 


0)3 ■ ■ ■ 





( 8 . 21 ) 
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Using (8.15), the CMV-matrix 
given by 


U for the orthonormal trigonometric polynomials is then 


Op Po 


■ 1 

Po -Op 


Ol pi 

02 p2 


Pi —Ol 

to 

1 

P 

to 


Op ■ ■ ■ 





( 8 . 22 ) 


for pj = 


Connection to quantum walks—the CMV-matrix 

Consider the quantum walk on the half-line U {0} with internal state set J = {t, t}, so 
the Hilbert space H is C{Ij x {t, t})- Then, taking the indices as 0 t, 0 t, 1 t, 1 J,, 2 t,..., 
following a proposal of Griinbaum in , the CMV-matrix U above can be used as the map 
for a quantum walk. With an abuse of notation, for a quantum walk on the line Z with the 
same internal state set and with indices ..., —2 J,, —1 —1 0 t, 0 J,, 1 t, 1 2 .., the 

matrix (underlines indicate indices corresponding spatial location zero) given by the product: 


Oi-2 P-2 
P-2 —0—2 

Op 

Po 


■ ■ ■ a_3 

0-1 P-l 

P-i —O-i 0 

0 ai Pi 
Pi —oi 

as ■ ■ ■ 

will also be called a CMV-matrix and the {oj} called the Verblunsky coefficients, although 
there is no longer a connection to the orthogonal trigonometric polynomials. 


Po 

— Oq 


02 p2 
p2 —02 


(8.23) 
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Coined quantum walks 

Once again, consider the quantum walk on the half-line U {0} with internal state set 
J' = {t, 4-}, so the Hilbert space H is C(Z x {t, t}). Then, taking the indices as 0 t, 0 1 t 

, 1 2 • • •, if if is either of the form 


Go bo 
Co do 

02 &2 
C2 1^2 


0 1 
1 0 

0 


(8.24) 


or 


0 1 
1 0 

0 


Oq bo 
Co do 

02 &2 
C 2 d2 


(8.25) 


then it is termed a coined walk with coins 


02j &2j 

C2j d2j 


which are unitary matrices. The 


hrst form is a CMV-matrix if b 2 j = C 2 j is a positive real or zero and if = —d 2 f then 
all the Verblunsky coefficients with odd index are zero and the Verblunsky coefficients with 
even index are given by q; 2 j = —d 2 j- The second form is the adjoint of a CMV-matrix if the 
same conditions hold. The difference between the two forms for a quantum walk is clearly 
just a matter of transforming the initial state by the unitary matrix 


0 1 
1 0 

0 


(8.26) 


Therefore, which form is adopted is largely a matter of convention, except for possible 
restrictions on the initial state. 

If all the coins are the same, the quantum walk is termed a coined walk with constant 
coin. Since the overall phase of the wavefunction is irrelevant, the phase of the determinant 

a B 

of the coin is arbitrary. Choosing it to be —1, the coin is necessarily of the form ^ 

for some a,/3 & C with \a\‘^ + |/9p = 1. Then the condition for the unitary matrix for the 
walk to be a CMV-matrix (or the adjoint of one) is that jB is positive real or zero. Similar 
terminology may be employed for walks on the full line. 
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8.3 The Parrondo-like paradox for quantum walks 


Set-up 


Consider the quantum walk on the line Z with internal state set J' = so the Hilbert 

space H is C(Z x {t, t})- Let P+ be the orthogonal projector onto spatial locations with 
positive index, P_ be the orthogonal projector onto spatial locations with negative index, and 
Pq be the orthogonal projector onto spatial location zero, so P_ + Pq + -P+ is the identity. 
Let U be the unitary operator that gives time evolution for one time step, p —)■ UpU*. 
Then, after n time steps, an observation is made with an observation set TZ with a-algebra 
{0,A,A,TZ} and base measure r given by t{A) = t{A) = 1. The pince-nez map L is given 
by 

Lp = l^tr pP+ + l^tr p(Po -h P-) (8.27) 


Then, by considering two different initial states, 'ip'ip* and and the geodesic joining them, 
we can analyze the occurrence and extent of the Parrondo-like paradox. 


Showing the paradox is impossible for certain classes of quantum 
walks 

Let Pot Le the wavefunction with one for f at location zero and all other amplitudes zero 
and Pot Le the wavefunction with one for j, at location zero and all other amplitudes zero. 
Consider the six following cases for the Verblunsky coefficients determining the CMV-matrix 
U: (i) aj = u^a-j for some a; G C C and all j G Z; (ii) a 2 j = —Q; 2 t+i = 
for some ca G C C and all j G Z; (in) aj = aZ] for all j G Z; (iv) aj = —aZ] 
for all j G Z; (v) a 2 j = a_ 2 j, Q; 2 j-i-i = —tt- 2 j-i for ah j ^ and (vi) a 2 j = —aZZ^, a 2 j+i = 
q;_ 2 j-i for all j G Z. With the preceding set-up, we have the following result; 


Theorem 9.3.1 If the Verblunsky coefficients are in any of the preceding six cases, then 
for any initial wavefunctions in the subspace for spatial location zero, spanj.{po'|-, pot), 
the Parrondo-like paradox cannot occur. 


Proof Let = P^pot and = U^poi- Then is related to by the following, 
for each n G {1,...} and j G Z, depending on the case: 


i) 

ii) 
Hi) 

iv) 

v) 

vi) 


AD 

AD 

AD 

AD 

AD 



AD 

= u ^ 1 


AD 

T 

1 

AD 

AD 

= 

/ (D 

AD 

DZ 

= 


AD 

= 


AD 

^iZ 

= (-l)^'+>i": 




(8.28) 
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Therefore, in any of the six cases, 


L(U^r)otr)*^U*^) U L(U^r)otrj*^U*^)\A 
L(U^r]oir]*^U*^) U L(U^r]oir/*^U*^)\A 


tr tr P_^'ip(0^(0* 

[ tr tr 

(8.29) 

((Pa + < 1 


Hence, by theorem 8.4.1, for any initial wavefunctions 'ip,^ in the subspace for spatial loca¬ 
tion zero and any point on the geodesic joining [ip] and [.^], the paradox cannot occur. □ 


In particular, note this rules out the paradox for the case of constant coin walks of the 
hrst form with initial wavefunctions ip,^ in the subspace for spatial location zero where the 

\/i^ 


a 


\a\ 




a 


-a 


42 


for some a G C with 


a\ 


< 1. Using a limit 


coin is of the form 
theorem by Konno 

coin walks (not necessarily in the CMV-matrix form) adopting the second form of coined 
quantum walks: 


41 , we have the following limiting result for more general constant 


Theorem 9.3.2 If a quantum walk has constant coin presented in the second form, then 
for any initial wavefunctions ip, 8, in the subspace for spatial location zero, spanj.jr/otj hoi}; 
the Parrondo-like paradox cannot occur in the limit as u —)■ cx). 


Proof Adapting the terminology of Konno to our notation, let the coin be given by 
^ ^ . The wavefunctions T^ot r^o^. are given by /3 = l,a = 0 and /? = 0, a = 1 
respectively. Then, employing Konno’s limit theorem, the limit of the sum of probabilities. 


lim ((F+UVot, + (F+PVo^, PVo^)) (8.30) 

n^oo 


is given by 



7r(l 


2^1 - | a |2 
— x‘^)-\/\a\'^ — x'^ 


dx 


1 


(8.31) 


Hence, by theorem 8.4.1, for any initial wavefunctions ip, ^ in the subspace for spatial location 
zero and any point on the geodesic joining [ip] and [.^], the paradox cannot occur in the limit 
as n —)■ cxD. □ 


Examples of quantum walks displaying the paradox to the 
maximal extent 

Example 9.3.3 With the above set-up, take initial wavefunctions 4^ = ^ (hot + hot) 

^ (hot ~ hot)- Then, halfway on the minimizing geodesic between them, the initial 
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wavefunction is Take all the Verblunsky coefficients to be zero except for q;_i, which 
has value Let U be the corresponding CMV-matrix. 

Let = U'^iIj, = U^^, and = U^'rjo^ for n > 1. Then has all amplitudes 
zero except for for I at location n, for I at location n — 1 , and for 'I' at location 
—n; has all amplitudes zero except for for at location K for I at location 

n — 1 , and for 'I' at location —n; and has all amplitudes zero except for for 'I' at 
location —n and — ^ for at location n — 1. Consequently, Pa^u = -P^n = | for all n > 1, yet 
^Tn ~ i for the initial wavefuntion halfway on the minimizing geodesic. By theorem 8.3.2, 
this example is on the boundary of allowed values of (pA,n, P'Am ^An) for the paradox. 


walk in the second form with coin 


Take e > 0 small. Take ai > 0 sufficiently 


Example 9.3.4 Again with the above set-up, let U be the matrix for the constant coin 

1 1 
-1 1 

J 

small relative to e such that the normal distribution with mean e and variance af has 
neglectfully small measure for (—cx),0). Take a G (—f,0) and (J 2 > 0 such that: (i) the 
normal distribution with mean a and variance has neglectfully small measure outside 
(— 1 , 0 ) and (ii) we have 


/ h c^Normal(a, 02 ) (k) = ^ 

vl + cos^ fc 3 


(8.32) 


Let 99 , C : Z t C be given by 


cpj = 


2ai 


exp 


(Tfp+^0-e)j ^ 


71 


exp 


a/^ J-t. 




Tiaf 


-dk 


7R 


-|- zjk 


V^J-00 </2 

+tjkj 


77a\ 


dk 


(8.33) 


0 =r 

TT V27r J-c 


exp 


(k—a)^ 

4o-2 


- 1 - ijk 




T7al 


dk 


(8.34) 


exp 


{k-af 

4cr| 


-1- zjk 


J-7. 


^27\al 


dk 
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Then, by the inversion of Fonrier series, 

.00 exp 
jez 

jGZ 


2 (t^ 


exp 


(k-af 


dk = 1 


(8.35) 






dk = 1 


TTcrS 


/ c 


exp 


(k-a? 


2 cr^ 


4a|“ 


'y27r erf ^2TTcr. 


dk ^ 0 


Hence, if the initial wavefnnetions are taken to be ■0 = + 93 ) 


and ^ = f (C - 


(p) ( 8 ) 


, they will be properly normalized and approximately orthogonal. Midway on the 


geodesic joining them, we have X = 


Now snppose for the observations we always Inmp the internal states for any spatial sites, 
so the (j-algebra for the observations is that generated by only the spatial locations. Let 
{Qb} be the complete set of mutually commuting, orthogonal projectors for this a-algebra. 
Dehne k : M —)■ [—tt, tt] by 


k { x ) 


farccos^ if a; e [-^, ^] 
<0 ifa:e[^,cx)) 

[tt if a: e (-cx),-^] 


(8.36) 


Then, for each n, let the complete set of commuting projectors {R^c^} for the Borel a- 
algebra on [0, tt] be given by = Q(n-K-i(c))nz- By Machida’s limit theorem 
weak*-hmit of the measures = (^R'^’^'^U'^il)^ U’^ip'^ exists and is given by 


48 


49 


the 


1 / TT 

-Normal f-£, exf 

^ 2 V2 ^ 


,{ 00 ) 


If sink \ ^ ,, , 

+ d P “ , o , Normal(-a, ^ 2 ) 

4 V V 1 + COS"' k / 


(8.37) 


If sin \ 

7 U Normal( 7 r + a, aa) 

4 V V 1 + cos"' k / 


Similarly, the weak*-hmit of the measures U^0 exists and is given by the 

same expression. For both of these, there is approximately | + j (l “ |) = | probability to 
have k G [O, |), which corresponds to spatial locations with positive index j. The weak*- 
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limit of the measures = (Q^^^U^Xy U^'x) ^-Iso exists and is given by 

^ 1 A - ^ ^ Normal(-a, (T 2 ) + \(i+ ) Normal(7r + a, as) 

^ V vl + cos"'fi:/ ^ V vI + cos^'k/ 

(8.38) 

However, now there is only approximately | (l — |) = | probability to have k G [O, |). 
These values of two-thirds for Pa, two-thirds for and one-third for P^° can be arbitrarily 
closely approached by taking e, ai, and as sufficiently small. By theorem 8.3.2, this example 
can approach arbitrarily closely to the boundary of allowed values of (Pa, P^, Pa°) fh® 
paradox. 


Remark The results of the preceding example also hold for the commonly employed 
Hadamard coin ^ J • Then the matrix U is the adjoint of a CMV-matrix with 

all Verblunsky coefficients with even index equal to For the Hadamard coin, we take 


initial wavefunctions to be V’ = | (C + y’) 


-1 


and ^ = f(C - (f) 


i 

-1 


, so we have 




i 

-1 


, where 


0 = 


TT 

(8.39) 

TT 

(8.40) 


for a, e, ai, and as as in the given example. The coin utilized in the example was chosen to 


agree with that used by Machida 48 49 
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Appendix A 

General propositions 


A.l Banach space propositions 

Notation Let A, B,. .. denote Banach spaces. 

Proposition Al.l Maps in 13(A, B) are continuous in the weak topologies on A and B. 

Proof Take any such map L. For any weak neighborhood 

A/'(La;0i,... = {b G B| - La)\ < e for j e n}} (A.l) 

with a G A, 01,..., G B*, and £ > 0 we have 

L(7V(a;L>i,...,L>„;£)) = A/'(La;0i,...,0„;£) □ (A.2) 

Corollary A1.2 A map in L G i3(A, B) is uniquely determined by its values on a weakly 
dense subset. 


Comment Starting with a weakly dense subset that is a vector space, its norm closure 
is necessarily a linear subspace by the properties of Cauchy sequences. Since this subspace 
is convex, by the separating theorem [^, it cannot be weakly dense unless it is the entire 
space; hence, a weakly dense subset is necessarily also norm-dense. This argument is not 
entirely satisfactory since the separating theorem uses the Hahn-Banach theorem 87 , which 
depends on the axiom of choice 88 . This dehciency will be rectihed for the situation of 


interest by proposition B1.3 below. 

The following important proposition is assigned as an exercise in 


125 


Proposition A1.3 Let A and B be Banach spaces. A subset V of A that is a vector space 
uniquely determines a map L G B{A, B) if V is dense in the norm topology on A and the 
operator norm of L restricted to V is bounded. The operator norm of L shares the bound to 
the operator norm of its restriction. 
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Proof Since a continuous function is uniquely determined by its values on a weakly dense 
subset, and since the norm topology is finer than the weak topology, if L exists, it is unique 
by A1.2. For any a G A, define La as the limit of the Cauchy sequence (Laj)^^, where 
(aj)^i is any Cauchy sequence converging to a composed of elements of V. The choice of 
Cauchy sequence does not matter because, given another Cauchy sequence (bj)^^ converging 
to a composed of elements of V, then {aj — is a Cauchy sequence converging to zero, 

so {L{aj — bj))^^ is a Cauchy sequence converging to zero. Since the product of a Cauchy 
sequence with a scalar is a Cauchy sequence and the sum of two Cauchy sequences is a 
Cauchy sequence, L is linear. □ 

A.2 Hilbert space propositions 

Notation Let H be a Hilbert space. 

Proposition A2.1 Convergence of a countable sum of disjoint {PjPk = 0 if j ^ k), 
orthogonal projectors, '^jPj, to an orthogonal projector P is equivalent in the following 
four topologies on H(H): ultrastrong-operator, strong-operator, ultraweak-operatoi[^ and 
weak-operator. 

Proof The weak-operator topology is courser than the other three, so convergence in any 
of the others implies convergence in it. Suppose convergence occurs in the weak-operator 
topology, so for any ' 05 ^ H, 


lim 

k^oo 


p-Y,pA 


i=i 


0 


Then, for any fixed k, 




i=i 


= lim 

m^oo 




i=i 


i=i 


ii=i 


(A.3) 


(A.4) 


Since -0,-0' were arbitrary, PY!j=iPj = Y!j=iPj- Similarly, P = Yl’j=iPj- 

Therefore, ^F — Yl’j=iPj^ = P ~ Yl^=iPj^ so F — Yl’j=iPj is itself an orthogonal pro¬ 
jector (it is clearly self-adjoint). Then, by taking -0 = ip', this implies 


lim 

k^oo 


f-Vf, 0 




= 0 


(A.5) 


^Same as the weak* topology. 







APPENDIX A. GENERAL PROPOSITIONS 


99 


so convergence necessarily also occurs in the strong-operator topology. Since the sequence 

/ k \ ^ 

of operators \P — ^~=iPj) is bounded in operator norm (all being projectors), the 

\ / k=l 

convergence necessarily occurs in the ultrastrong-operator topology as well. However, the 
ultrastrong-operator topology is hner than the others, so convergence in it implies conver¬ 
gence in the other three. □ 


A.3 Transition function propositions 


Notation For the following, sets will be denoted X, y,... and a-algebras by X ,.... The 
spaces of hnite, signed-measures on the given set with the given a-algebra will be denoted 
AA{X]£), AA{y]IF)y ... These are Banach spaces using the total variation norm. Note-by 
convention product measures are written using x although they are actually tensor products 
and should be written using 


see 


111 for a use of the latter notation). 


Following 69 , we have the following: 


Definition A3.1 For a-algebras £ on X and on 3^, a function r(-|-) • ^ is a 

transition function if: (i) for each x E X, r(-|x) G AAiy^IF)] and (ii) for each B E X, 
is a bounded, £^-measurable function on X. 

If t(-|-) is positive and has the additional property that r(3^|-) = lx-, then it is termed a 
transition probability function. The transition functions for specihed {£, X, 3^) clearly 
form a vector space. They form a Banach space using the norm 

||r(-|-)|| = sup II r(- lx) II total variation (A.6) 

x&X 

A transition function t(-|-) with specihed data {£, X, y) induces a linear map L E 
B{M{X-£),M{y]B)) via 

{Lfi){B) = f T{B\x)dfr{x) (A.7) 

Jx&X 

Not every bounded linear map is induced by a transition function; in general, a pseudo¬ 
transition ‘function” is required (see B2.7 and B2.8). However, we do have the following: 


Proposition A3.2 If L G B{M{X-£),M{y]B)) and K E B(M(y; B), M(£; Q)) are 
both induced by transition functions, then the composition K o L is also induced by a 
transition function. 


Proof Let L and K be such maps, with associated transition functions t(-|-) and 
Dehne a;(-|-) by 


= / i^{C\y)d{T{-\x)){y) 

Jy&y 


uj{C\x) 


(A.8) 
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for C G ^ and x E X. Then ti;(-|x) G A4{Z;Q). To see that oj{C\-) is T-measurable, note 
that, since i'{C\-) is J^-measurable and bounded, there is a sequence of simple functions 
converging uniformly to it. By the Dominated Convergence theorem 

then have 

u:{C\x) = lim hjkT{Bjk\x) 

j^OO ^ ^ 
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, we 


(A.9) 


By 89 , hnite sums of measurable functions are measurable and the pointwise limit of a 


sequence of measurable functions is measurable, so a;(C'|-) is measurable. Hence, a;(-|-) is a 
transition function. 

Now take any /i G A4(X;S) and C G Then, using the preceding results. 


((K o L)(/x))(C) = / iy(Cjy)d(L/^)(y) = lim '^hjk{Ly){Bjk) 


'yey 


(A.IO) 


lim N bjk / T{Bjk\x) dyL{x) = / u:{C\x) dy{x) 

u Jx&X JxdX 


Hence, 77 o L is induced by the transition function cj(-|-). □ 

Proposition A3.3 If L G B{M{X-£),M{Z-g)) and K G are 

both induced by transition functions, then the tensor product map 77 0 L is well-dehned 
and also induced by a transition function. 


Proof Let L and K be such maps, with associated transition functions t(-|-) and z^(-|-). 
Dehne by uj{-\x, y) = t(-|x) x i'{-\y) for each x E X and y E y. Then a;(-|x, y) is clearly 
in 7W(A X 3^^ .2^^, wliGFG 2^ is tli6 (T-cLlgcbrcL gGiiGFBjtGci by tliG TGctcLiigiilcLr subsets 7^ — ^ X 
The rectangular subsets form a semialgebrej^ Then the hnite union of rectangular sets is an 
algebra. 

Following Hausdorff 26 , let Rq be the rectangular subsets 77. For each ordinal a, let 
Ra be the collection of subsets of A x 3^ that are the countable intersection of subsets 
from the various collections Ry for ordinals /7 < a if a is even and that are the countable 
union of subsets from the various collections Gy for ordinals /7 < a if a is odd (where all 
limit ordinals-those without a predecessor-taken even). For clariheation, using the standard 
notation [^, Ri = IZ^, 7?2 = TZaS, and so on for the hnite ordinals. 

Fix any ordinal a and suppose that for all subsets B in all collections Ry for /3 < a we 
have the following property: there is an I-measurable Ci;(77|-) such that 


{{LzK)y){B)= u{B\x,y)dy{x,y) 

J{x,y)£Xxy 


(A.ll) 


collection of sets is a semialgebra if it is closed under intersection and the complement of any set is 
a finite union of sets in the collection. 
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for any X-measure fi on A x y. Take any C G Ra- If « is odd, we have a sequence {Bj) 
of subsets from the various R^ with (5 < a such that C = \J-Bj. Let oj{C\-) = \J ■u{Bj\-). 
Then oj{C\-) is X-measurable (see 1^) and we can extend L ® K hj 


{{L®K)^^){C) = 


'(x,y)GA:xy 


u(Cjx,i/) dn{x,y) 


(A.12) 


for any X-measure /i on A x hence, since C was arbitrary, Ra has the property. Similarly, 
if a is even, we have a sequence {Bj) of subsets from the various Rjs with (3 < a such that 
C =J\jBj. Let oj{C\-) = /\-uj{Bj\-). Then oj{C\-) is measurable and we can extend L® K 
by ( |A.12 ); hence, Ra also has the property. 

However, Rq has the property-for any X-measure /r, (X 0 K)y, is uniquely determined on 
the rectangular subsets by 


{{L®K)yi){A xB) = 


'{x,y)exxy 


T(A\x)v(B\y) dii(x, y) 


(A.13) 


(see [^)-so by 
Following Kuratowski 


transhnite induc- 


43 , the (T-algebra X 


and oj{A x B\x^y) = T{A\x)h'{B\y) is X-measurable 
tion [116 all the Ra have the property, 
is given by the union Therefore, Ci;(-|-) is a transition function and induces a well- 

dehned tensor product map K ® L. 


Comment The preceding proof does not depend on the axiom of choice since the union 
only needs to be taken up to the ordinal number for the minimal uncountable well-ordered 
set 


43 , whose existence does not depend on the axiom of choice 55 


Lemma A3.4 For any measure space (A’,X,/i), any “function” / G X°(A;/x) contains 
some X-measurable function. 


Proof Take any function g E f. For each x E A where g{x) E {±oo, undehned}, redehne 
g so g(x) = 0. The new g is still in /. Now construct the sequence of p-measurable simple 
functions (gk) converging pointwise to g by 


go — 0,gi — l{g>i} — 


(A. 14) 


and for fc G {1, 2 ...}, 


9k+l 


= 2^-1 


{g>2A 


4 k_i 

E 

m=l 


m 


2k 


- 2^ - 


-1 

E 

m=—A^+l 


m 


m—l ^1 


2 ^ ■ 1 


{g<-2A 


(A.15) 


By the dehnition of the completion Eq of £ (^, for each Xo-measurable subset B, there is 
some D E £ such that D B and /i(X)) = /x(~ H); hence C =~ D is in X, is contained 
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within B, and has fi{C) = Using this, for fc G {0,1, 2...} there are snbsets {C'^} G S 

snch that: 

C {(? > 2^} and /i = 0 for = {^ > 2^} \ (A.16) 

C > ^ > and /i = 0 

for ^ ^ P} \ G {1, 2,..., 4^ - 1} (A. 17) 

< ^ < -^1 and /i = 0 

for < ^ < _p| \ ^2+'), m G {-1, -2,..., -A^ + l} (A.18) 

C{g< -2'^} and fi = 0 for = {g < -2>^} \ (A.19) 

Hence, there is a sequence of £^-nieasurable simple functions (hk), where 

4fc 

\^ /A \ 

ho = 0, hi = 1^(1) — 1^(0, hfc+i = 2_^ ^ ^ci^+o (A.20) 

m=—4^,7777^0 


Then (hk) converges pointwise to a T-measurable function h 89 . The function h differs from 


g on at most the subset uu ^ Dm\ which is the countable union of /i-null subsets; hence, 
itself yU-null. Therefore, h is necessarily in /. □ 

For the following, {X,S,fj.) is a measure space with /i(A) = 1 and is a cx-algebra on y. 


Definition 3.5 A conditional probability z^(-|-) is a L^(A; /x)■''-valued, vector J^-measure on 
y with the property that ^(d^l-) = [lx]. 


Definition 3.6 A transition probability function t(-|-) is a lift of a conditional probability 
i/(-|-) if t{B\-) G i^(-B|-) for every subset B e X. 


Proposition A3.7 If either: (i) IF is generated by countably many atoms; [ii) y is 
an uncountable, complete, separable, metric space with IF the Borel cx-algebra; or {iii) 
£ is generated by countably many atoms-then for any marginal probability /i on A and 
conditional probability n(-|-), there is some (generally non-unique) lift x(-|-). 


Proof—case(i) For each atom A G A, choose a particular, T-measurable function h{A\-) G 
z>(A|-) (these exist by A3.4). The function h{A\-) can only be less than zero on a set of 
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yU-measure zero; hence, the set Bi where any of these fnnctions is less than zero is still of 
yU-measure zero. Furthermore, since the union of all atoms is A, we have 

5^ v{A[) e v{X\-) = M (A.21) 

atoms AGlT 


where the sum on the left-hand side converges pointwise; hence, the function given by the 
left-hand side has value one almost everywhere with respect to fi. Let B 2 be the set with p- 
measure zero where it is not equal to one. Let be a set in £ with yu-measure zero containing 
Bi U B 2 (such a set necessarily exists by the dehnition of yu-measure zero [^). Then, dehne 
r(-l-) by 


r[B\x) 


^^atoms ,A(ZB HA\x) ifx^B 

dfi il X E B 


(A. 22 ) 


for any B E IF and x E A. It is readily seen that r(-|-) is a lift of zz(-|-). 


Case(n) The obvious approach is to use the separable and metric properties to form 
countable, locally hnite open covers of decreasing radii using the paracompact property of 
[S^-from these a sequence of rehning partitions can be formed. The 


91 


metric spaces 

cr-algebra generated by these partitions is the Borel a-algebra; one may then hope that a 
procedure similar to the preceding with a choice for each element of each partition would 
give the lift t(-|-). However, while it easily seen this approach succeeds if a lift exists (and 
we will show one does below), it cannot be used to prove the existence of a lift; the problem 
is that to employ the Extension theorem 92 pointwise to get a measure, unlike the situation 


above, there are an uncountable number of conditions that must be met. While each fails on 
a set of measure zero, it is not clear that taken altogether the set where any one of them fails 
is still only of measure zero. (The existence of a lift would give this, but that is what we are 
trying to prove!) Instead, we will proceed in a different direction. The plan is to add in an 
auxiliary measure space, namely [0,1] with the standard topology, the Borel cr-algebra, and 
Lebesgue measure. Then we will construct a deterministic transition probability functiorj^ 
(see A3.9 for the connection to another common concept of determinism); integrating over 
the auxiliary space will then provide the desired lift. 

Start with the result due to Kuratowski that y is Borel equivalent]^ to the interval [0,1] 
with the standard topology 93 ; let -0 ^ [O 5 1] —)■ 3^ be the corresponding bijection. For A 


Lebesgue measure on [0,1], let the cT-idea|^A/' be the ywx A-null subsets of cr (£! x Borel([0,1])). 
For each a E [0,1], let i>('0([O, cr))|-) be a particular £!-measurable (not merely p-measurable) 
function from i/('0([O, (a))|-) (these exist by A3.4). Dehne the equivalence class A^ of subsets 


^One taking only the values zero and one. 

“^Two topological spaces X and y are Borel equivalent if there is a bijection ijj : X ^ y such that both 
b and '0“^ take Borel sets to Borel sets (so both 0“^ and 0 are Borel measurable). 

a-ideal is a subcollection of a cr-algebra that contains all subsets of member sets within the tr-algebra 
and is closed under countable unions. 
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oiAx [0,1] by 


IJ {x|z>(t/>([0,a))|x) > s} X {s} 

= 

IJ {a^|i>(V’([0,a))|a:) > g} X [0,g] 

/6[0,1] 


_geQx[0,l] 


where the equivalence class is taken over sets that differ by subsets in the cr-ideal Af. Clearly 
this is independent of the choice of z/('^([0, a))|-)- Note Aa = [df] for a > 1 and A^ = Af for 
a < 0. 

Then the collection {Aa} is a somej^By [^, there is a cr (T x Borel([0, l]))-nieasurable 
function <^9 : df x [0,1] — )■ [0,1] for which the somej^of [ip] is Aa- (Note ip is not unique; any 
particular function in [ip] will do.) 

Now dehne r('0([O, a))|-) by 

r(V’([0, a))|a;) = essential sup {s G [0, l]|(p(a:, s) G [0,a)} (A.24) 


where the essential supremum is taken over equivalence classes of sets differing by Lebesgue- 
null sets. By Tonelli’s theorem 96 , r(V’([0, a))|-) is an T-measurable function. Furthermore, 
it is it z/('0([O, a))|-). To see this, note that by Tonelli’s theorem, for almost every x E A 
with respect to /i, the sets 


{s G [0, l]|z>('0([O, a)) lx) > s} and {s G [0, l]|</5(x, s) G [0, a)} 


(A.25) 


differ by only a Lebesgue measure zero subset. Taking the essential supremum over each set, 
we have that z>('^([0, q;))|-) = r('^([0, a))|-) almost everywhere with respect to pi. 

More generally, for any B G Borel(3^), dehne t{B\-) by 

t{B\x) = a ({s G [0,1 ]\^jJ o ip(x, s) E B}) (A.26) 

Since the composition of measurable functions is measurable (readily seen by the dehnition 
of measurability), then, by Tonelli’s theorem, t{B\-) is T-measurable. For each x E A, 
r(-|x) is a Borel measure on 3^ with r(3^|x) = 1. Finally, since the collection {[0,a)} over 
a G M generates the Borel cr-algebra on [0,1], the collection {'^([0,a))} generates the Borel 
cr-algebra on 3^, so t{B\-) E i^{B\-) for any B E Borel(3^). 


Case(m) For each atom A E S with fi{A) > 0, u{B\x) is well-dehned for any x E A and 
B E J-, so t{B\x) is necessarily i>{B\x). On the remaining atoms, 'r(-|-) can be set rather 
arbitrarily; take it to be djji for concreteness. □ 

Examining the proof of the preceding theorem, we have the following interesting result: 

^Following [^, for tr-algebra £ with tr-ideal given by null sets Af, the collection of equivalence classes 
indexed by reals {Aa} of £/Af is a soma if: (i) Aa < Ap for a < and (ii) Aa = Vj Ap- for any countable, 
increasing sequence {(3j) converging to a. 

^The soma of the equivalence class [/] of a real valued function / is the collection of equivalence classes 
of preimages, [/“^((—oo, a))], over a G K. 
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Proposition A3.8 If either: (i) IF is generated by countably many atoms; or [ii) 3^ is 
an uncountable, complete, separable, metric space with IF the Borel cr-algebra-then for any 
transition probability function r(-|-) \ IF x X —)■ [0,1] there is a deterministic transition 
probability function x A x [0,1] —)■ {0,1} such that 

t{B\x)= [ ^{B\x,s) dX{s) 

Js£[0,l] 

where A is Lebesgue measure on M. 


Proof First take case (i). By the dehnition of being countable, there is an ordering of 
the atoms {Aj} for j G {1,2,...} (with the indexing set truncating to j G (1, 2,..., n} if 
there are only hnitely many atoms). For m G (1,2,...} (or m G (1, 2,..., n}), dehne Bm by 
Aj and C A x [0,1] by 

IJ {a;|r(5^|a;) > s} X {s} = (J {a;|r(5^|a;) > g} x [0, g] (A.27) 

se[o,i] <?eQx[o,i] 

Then {Cm) is an increasing, nested sequence of subsets in x [0,1]) such that ~ 

A X [0,1]. 

Dehne the function 99 : A x [0,1] —)■ lA by 


s) = Y^ 1x\c, 
j 


Dehne ^(-l-) by 


e(5|a;,5) 


1 if A(p^x,s) T B 
0 otherwise 


for any i? G A, a; G A, and s G [0,1]. Dehne r'(-|-) by 


t'{B-x)= f ^{B\x, s) d\{s) 
Js&[0,l] 


(A.28) 


(A.29) 


(A.30) 


for any B & IF and x E X. It is readily seen that 'C('I') is a deterministic transition probability 
function and t'(-|-) is a transition probability function. Furthermore, for any m G (1, 2,...} 
(or m G (1,2, ...,n}). 


T'{Bm\x)=[ ^(5^|a;,s) dA(s) = A (C,„ n {x} X [0,1]) = r(5m|a;) (A.31) 

ase[o,i] 

Since the collection {Bm} generates A, t(-|-) = t'(-|-). 

Now take case {ii). As with the proof of the preceding theorem, start with the proposition 
that y is Borel equivalent to the interval [0,1] with the standard topology; let ^:[0,1]^3^ 
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be the corresponding bijection. Define the x Borel([0, l]))-nieasurable subsets A^ of 
A” X [0,1] by 


= IJ {a;|r('0([O,Q!))|x) > s} X {s} (A.32) 

se[o,i] 

= IJ {x|r(^/>([0,a))|a:) > g} X [0,g] (A.33) 

<?eQx[o,i] 


Note An = A for a > 1 and = 0 for a < 0. 


Then, following 97 the collection of ordinate sets {A^} defines a a(S x Borel([0,1]))- 
measurable function g? : A x [0,1] ^[0,1] by 


(p{x,s) = inf{a G M|(a;, s) G A^} 


Define by 


e(i?|x,s) 


1 if o ip(^x, s) E B 
0 otherwise 


for any B E X, x E A, and s G [0,1]. Define by 


(A.34) 


(A.35) 


t'{B-x)= f ^{B\x, s) dX{s) (A.36) 

ise[o,i] 

for any B E IF and x G A. It is readily seen that is a deterministic transition probability 
function and 'r'(-|-) is a transition probability function. Furthermore, for any a G M, 


r'(?/>([0,a))|x) = [ ^(^/>([0,a))|x,s) dA(s) 

^se[o,i] 

= A (A„ n {x} X [0,1]) = r('0([O,Q!))|x) (A.37) 

Finally, since the collection {[0,a)} over a G M generates the Borel cx-algebra on [0,1], the 
collection {'0([O,a))} generates the Borel cx-algebra on 3^, so r(-|-) = x(-|-)- D 


Comment Note the preceding can be readily extended to cover cases where /r is the zero 
measure apart from a sub-a-algebra of S generated by countably many atoms or the marginal 
probability on y formed from z/(-|-) and fi is the zero measure apart from a sub-a-algebra of 
A generated by countably many atoms. 


Proposition A3.9 If either: (i) B is generated by countably many atoms; or {ii) y is 
an uncountable, complete, separable, metric space with B the Borel ex-algebra-then any 
deterministic transition probability function r(-|-) : A x A —)• {0,1} is induced by some 
measurable point transformation / : A —3^, t{B\x) = (1^ o /)(x). 
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Proof First take case (z). For each x G A’, take f{x) to be any value in the atom A & A 
with the property t{A\x) = 1 riy \ A\x) = 0 . 

Now take case (m). As with the preceding proofs, start with the proposition that y is 
Borel equivalent to the interval [0,1] with the standard topology; let -0 : [0,1] —)■ 3^ be the 
corresponding bijection. For each x E X, take /(x) to be dehned by 

/(x) = 0(sup{a e [0, l]|r(0([O,a))|x) = 0}) (A.38) 

Then/is measurable since {(a, 1] I a G M} generate the Borel subsets of [0,1],so {0((a,l])|a G 
M} generate the Borel subsets of 3^, and 

/■^ (0((q;,1])) = {xl^"^ o/(x) > a} = {x|r(0((a,l])|x) = 1} G T □ (A.39) 

Proposition A3.10 If the marginal probability /i on A and conditional probability z^(-|-) 
are such that the resulting joint probability is absolutely continuous with respect to the 
product measure /i x p for some A-measure r] on 3 ^, then there is some (generally non¬ 
unique) lift 'r(-|-). 

Proof The joint probability measure p resulting from p and z^(-|-) is 

p{A xB)= [ u{B\-);dp (A.40) 

Ja 

for any rectangular subset A x B E £ x B. By the Radon-Nikodym theorem [7^, there 

dp —* 

is a “function” g E L^(X x y; u x p), the Radon-Nikodym derivative g = — -r, such 

d{p X p) 

that p = g p X p. Choose any particular a{S x A)-measurable function g E g (such a 
function necessarily exists by A3.4). The function g is less than zero on a subset of p x re¬ 
measure zero; by changing g to be zero there, the new g is still in g. By Tonelli’s theorem, 
/(i?|-) = Jy^^g{-,y) dpipy) is T-measurable. Take r(R|x) to be /(-B|x) for all x where 
^(A|x) = 1 ; on the T-measurable,/i-null subset where ^(A|-) 7 ^ 1 , take r(i?|x) tobep(Axi?). 
□ 

Comment Instead of the approach of A3.7, one may attempt a simpler approach using 
the notion of liftings from L°°(A; p) to T-measurable functions on X. 

Definition A3.11 A map from L : L°°{X]p) —>■ /i-measurable functions on A is a linear 
lifting if it has the following properties: (i) L is positive, linear, and continuous (bounded); 
(a) Lf E f for any / G L°°(A;/i); and (Hi) L[lx] = lx- 

Definition A3.12 A linear lifting is a lifting if it also respects the multiplicative structur^ 
of L“(A;/i): (iv) K{fg) = Kf Kg for any f,g E L°°{X;p). 

®Then the lifting K necessarily takes “functions” containing characteristic sets to characteristic sets, 
which reduces the problem of the existence of liftings to that of the existence of set liftings. 


120 121 
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Definition A3.13 A lifting is an S-lifting if it maps into £^-measurable functions (rather 
than simply /i-measurable ones). 


The existence of £^-hftings is in general an open question, even in simple casesj^ Note the 

120 does give the existence of a lifting K : jj) —)■ £^o-measurable 


Lifting theorem 

functions on X. However, the completed cr-algebra depends on the marginal probability 
p, so this is not what we desire. Furthermore, the proof of the Lifting theorem relies on the 
Axiom of Choice and is nonconstructive 70 , so it is of little practical use. In addition, as 


the following example demonstrates, this approach cannot work in general: 


Counter-example A3.14 Let A = 3^ = [0,1] with the usual topology. Let £ = IF he the 
resulting Borel a-algebra. Let A be Lebesgue measure on [0,1] and the joint probability p 
be the diagonal measure 

p{E) = \{{xe[hO]\{x,x)eE}) (A.41) 

Then the marginal probability p = X and the conditional probability z^(i?|-) = [1^]. There 
is an obvious lift given by = 1 b- However, this cannot be written as a lifting applied 

to else, since ^ t{B'\-) for B ^ B' yet B and B' differ by a A-null subset, so 

[1 b] = [Is'j; we have the contradiction 

1b = r{B\-) = K [1b] = K [Ib^] = t{B'\-) = U' 

We now claim that there is no lift t(-|-) of which can be written as t{B\-) = Kv{B\-) 
for all Borel subsets B for a lifting map K. 


Proof Suppose there were such a lift coming from a lifting map K. Since liftings take 
characteristic functions to characteristic functions, K induces a set lifting k by A'[1 b] = 
Ic ^ C = kB. Let Aq = [0,1]. Then kAq differs from Aq by a A-null set. (Actually, 
k[ 0, 1] = [0,1], but we are seeking to establish a general pattern.) By definition, there is an 
open set Bq that covers Aq \ kAq with \{Bq) < | (one-quarter can be replaced by anything 


strictly between one-half and zero). Now, iteratively, define Aj 


rrij rrij+l 


to be either 


the subset 


rrij—i 2 mj_i+l 

or 

2 mj_iH-l 

rrij-l+l 

2 J-1 ’ 

23 ’ 

to 

1 


depending on which one has 






A {A, n Hfc) < - A (A,_1 n Bk 


(A.42) 


k=0 


k=0 


Once we have Aj, we take Bj to be an open set that covers Aj \ nAj with X{Bj) < then 
continue to 

Since [0,1] is compact and each Aj is closed, the decreasing nested sequence of subsets 
{Aj) has nonempty intersection by the finite intersection property 98 57 . Since the diam¬ 
eters of the Aj tend to zero, this intersection is a single point-call it xq. Then xq is also 


^For instance, X = [0,1], Q being the Borel tr-algebra, and g being Lebesgue measure. 


121 
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in the intersection nAj. Suppose otherwise, so it is not in some nAj] hence, it is in Bj. 
Since Bj is open, then there is some ball, (xq — s^Xq + e), also in Bj. Then, for k sufficiently 
large so e > 2“^, Ak C Bj. However, 

( k \ k 

m=0 / m=0 

m=0 

Continuing, we eventually arrive at 

A I Hfe \ IJ j > ^ 2fc-m4m+i ^ 2 ' ^ ^ (A.44) 

which is a contradiction to Ak C Bj. Hence, Xq G {^-nAj. Note, by the positivity of K, 
{nAj) is also a decreasing nested sequence (with nAj C Aj) and xq is the only point in the 
intersection. 

However, this leads to a contradiction. For each j, let Ej = Aj \ {xq}, so [Jj Ej = 0. 
Then, since Aj rsj rsj KAj, pointwise 

lim Ku{Ej \ ■) = lim K [ip] = lim (A.45) 


which is zero except at Xq, where it has the value one. For each j, let Fj = Ej \ Ej^i, so 
{Fj} is a disjoint collection. Then, pointwise, by the countable additivity of measures, 

^av(Fj|.) = av([Jf, 

j V j 

However, for any m, 


m 

= J2Kiy{F^\-) + Ku{E^\-) (A.47) 

i=i 

Taking the limit as m —)■ cx) gives the right-hand side as 

^ Kiy{Fj \■) lim Kv {Em\■) = l[o,i] + l{a;o} (A.48) 

j 

which is a contradiction. Hence, there is no lift r(-|-) arising from a lifting map K for the 
conditional probability zz(-|-). □ 


40 , 1 ] 


= Ku 


Ug 


= Ku 



^ — Fu ([0,1] \ {xq}! ■) — 1k[o,i] — l[o,i] (A.46) 
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Appendix B 

Propositions for option I 


B.l Measures 

Notation For the following, sets will be denoted X, y,. .. and a-algebras by S, X,. ... The 
spaces of hnite, signed-measnres on the given set with the given a-algebra will be denoted 
M.{X] £), Ai{y] X),.... These are Banach spaces nsing the total variation norm. For a given 
measnre space let Sq be the cx-algebra given by the completion of £ with respect 

to yU (see (^). The extension of /i to £o will still be denoted /i. Then we will say /r is a 
y -measure if is a snb-cr-algebra of £q. An element of To will be termed ^-measurable. 

Definition Bl.l A snbset of signed-measures, M C A1(A’;T), is absolutely-eontinuous- 
eomplete if, for any /x in M, all signed-measures in mathcalM{X] £) absolutely continuous 
with respect to |/i| are also in M. 

Proposition B1.2 Any absolutely-continuous-complete subset that is a vector space is 
directed-complet^ 

Proof Let M C M(X;£) be such a subset. Given any upward-directed subset B C M, 
bounded by above by some p G M, define V/iSS E G £, 

( V ^ 1 = sup/i(E) (B.l) 

\,Ib J 

This exists since it is bounded from above by p{E). It is readily seen that V^es is greater 
than or equal to (using the partial ordering) any p ^ B and that, given any other z/ G M 


partially-ordered subset is upward-directed if, given any two elements, there is a third that is greater 
than or equal to both. A set is directed-complete if any bounded, upward-directed subset has a supremum. 
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with that property, \J Given any countable collection of disjoint, measurable sets, 
C by the upward-directed property and the countable additivity of each /x G -B, 


j —1 J J \j —n+1 / \fi^B J 


' oo \ 00 / \ 

U 2 E V (b-2) 

J=1 / i=l \^l&B J 


for any n G {1, 2,...}. Since z/ < V^es < P for any u E B, and both p and n are countably 

additive, both (V^en/^) (U“n+i ^i) ^ 0 and the tail sum Y.T=n+i iyi^j) ^ 0 as 
n —)■ cx), so 

E (V^ (V f*) (^ E (V '“'j (B-3) 

j=l \ii&B / \ti(^B / \j=l / j=l \ii&B / 

Therefore, V/^eB T countably additive, so it is a signed T-measure. Its total variation 

norm is bounded by the total variation norm of p V (—n) for any particular u E B. Since 

V < V^es/^ — d) y ij.eB T is absolutely continuous with respect to \p\ + |z/|, so p E M. 
□ 


Comment The following proposition corrects the defect mentioned after A1.2 for the 
particular case required. 

For the following two propositions, let Q be the a-algebra generatec^ by the rectangular 
subsets S X IF, denoted Q = a(£ x IF). Let the absolutely-continuous-complete subsets 
M C A4(A;£), N C Ai(y-,IF), and Q C M(A x y;Q) also be vector spaces. 


Proposition B1.3 If M 0 iV is weakly dense in Q, then it is norm-dense in Q. 


Proof Suppose there were some measure p G not in the norm-closure of M 0 iV, 
so there is some e > 0 such that \\p — z^|| total variation > £ for all n G M 0 N. By Hahn- 
decomposition 100| and the absolutely-continuous-completeness property of M and N, if 
V E M ® N, then \v\ E M ® N. Let p T \v\ be the singular part of p with respect to \v\ 


using Lebesgue decomposition 76 . The set of ^-measures {/i T |z/||n G M 0 A} is lower- 
bound by the zero-measure. It is downward-directed since p T (|z/| -|- |z/'|) is less than or 
equal to (in the partial ordering) both p T \v\ and p T \v'\ for any E M ® N. Hence, 
by the preceding proposition, the ^-measure r = I\i,^m®n T -L \^\ exists. Furthermore, from 
its dehnition in the preceding proof, this measure has total-variation norm greater than or 
equal to £. Let p|| r be the absolutely continuous part of p with respect to r using Lebesgue 
decomposition. Now consider the bounded, linear functional ^ E x y;S)* given by 

<hp = {p II t){A X 3^). This is zero on M 0 A, yet <F/i = ||r||totai variation > hence, p is not 
in the weak-closure of M 0 iV. □ 


^For any collection A of subsets of X, the cr-algebra generated by A, (j{A), is the smallest cr-algebra of 
subsets of X containing A. 
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Proposition B1.4 If M 0 is norm-dense in Q, then M~^ 0 ®+ N~^ is norm-dense in Q^. 


Proof By Hahn-decomposition and the absolutely-continuous-completeness property of M 
and N, the measures in M 0 iV are total-variation norm-dense for the measures Q~^. Given 
any measure fi = ® D ^ M ® N^ it is absolutely continuous with respect to the 

product measure p = 0 kfel), which is in M 0 by Hahn-decomposition 

and the absolutely-continuous-completeness property of M and N. By the Radon-Nikodym 
theorem 72 , there is a positive “function” ^ G L}{X x 3^;p) such that p = ^p. By 

proposition B3.1 below, ^ can be arbitrarily well approximated in total-variation norm by 
elements of kil) '^R+ (3^5 Y.k ^ 


Proposition B1.5 The total-variation norm is a cross-norm for product measures. 


Proof Let p G A4(A;T) and z/ G A4(y;X). By Hahn decomposition, \p x iy\ = \p\ x |z/|. 


so 


\\p X v\\ = \px v\{x xy) = \p\{x)\v\{y) = \\p\\\\u\ 


□ 


(B.4) 


Proposition B1.6 If /i is a cr-hnite measure on X, then there is no uncountable collection 
{Aa} of subsets of X with the properties that p{Aa) > 0 for all a and p{Aa fl A^) = 0 for 
all a (3. 

Proof Suppose otherwise. Since p is a-hnite, there is a countable, disjoint collection {Bj} 
of subsets of X such that each Bj has hnite /i-measure and their union OjBj is the entire 
space X. Now suppose there is some £ > 0 and j such that for inhnitely many of the {Aa}, 
p{Bj n Aa) > e. That immediately contradicts p{Bj) being hnite. Therefore, it must be 
the case that for any e > 0 and j, only hnitely many of the [Aa] satisfy p{Bj fl Aa) > s. 
Let {Ak} be the countable collection formed by taking the union over all j and over all 
e G {1,2“^, 2“^,...} of such {Aa}- However, this also leads to a contradiction, since for any 
Ai3 ^ [Ak], piAp n Bj) = 0 for all j, yet 0 < p{Af}) = h (^/3 G Bj) = 0. □ 

Proposition B1.7 If a a-hnite measure is inhnitely-divisibl^ then it has no atoms-for- 

the-measur43 

3A cr-finite measure for a measure space {XT, A) is infinitely-divisible if for any e > 0, there is a 
countable (finite if fj- is finite) partition of X, {Bj}, with each fJ-Tj) < 

^For a measure space {X,£,fi), a subset A G £ is an atom-for-the-measure ji if for any B G £ with 
B G A, either ^(B) = 0 or /r(T \ B) = 0. 
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Proof Consider the following property: a measnre cr-finite /i for a measnre space {A, S, /i) 
has the once-divisible property if for any snbset A ^ S with hnite fi{A) > 0, there are 
disjoint B,C E £ with B UC = A snch that both /i(-B) and /r(C') are less than ^fi{A). This 
property is readily seen to be eqnivalent to being inhnitely-divisible. Fnrthermore, if there is 
an atom-for-the-measnre, then the once-divisible property does not hold. Hence, it remains 
to show if the once-divisible property does not hold, there is necessarily an atom for the 
measnre. 

Snppose there is a snbset A E £ with hnite /i(H) > 0 snch that there are no disjoint 
B,C E £ with B U C = A snch that both /i(i?) and /i(C') are less than Let c be 

the inhnnm over b E l] snch that the division is possible for both fi{B) and /i(C') are 
less than b ■ fi{A). If c = 1, then A is an atom-for-the-measnre fi. If there is a division with 
p(H) = c- fi{A), then B is an atom-for-the-measnre fi. The only remaining possibility is that 
for every £ > 0, there is a division with /i(i?) < {c-\-e)^{A). By taking e E {2°, 2“^, 2“^,.. .} 
snccessively, we get a seqnence of snbsets (Bj) with c ■ fi{A) < fi{Bj) < (c -|- 2“-^)/i(H). For 
every j < k in c ■ fi{A) < jJi{Bj fl B^) < (c -|- 2 “*^)/r(H); hence, is in £ with 

h (n“ -qBO = c ■ /r(H)-a contradiction. Hence, this possibility cannot occnr. □ 


Comment Note that an atom-for-the-measnre need not be an atom for the cr-algebra. 
For instance, the atomic measnre 5{o} on M has all snbsets containing 0 as atoms-for-the- 
measnre. For separable, metric spaces with the Borel a-algebra, all atoms-for-the-measnre 
are indnced by atomic measnres in this manner. [M] However, there are other ways atoms- 
for-the-measnre can occnr. Consider the measnre on [0,1] assigning zero to all conntable 
snbsets and one to their complements. Then all the snbsets with conntable complement are 
at oms-for-t he-measnre. 

Take the measure /i hnite. Then by successive partitioning and an arguments similar to 
that used in the preceding proof, for any e > 0 it is possible, with only hnitely many steps, 
to hnd disjoint atoms-for-the-measnre {Aj} such that any atoms-for-the-measnre A in the 
restriction of /i to A” \ IJ^. Aj have fi{A) < e. By taking e G {2°, 2“^, 2“^,...} successively, 
we get a countable partition of subsets in £ 


^ j / j 


(B.5) 


where /i restricted to A\[J^- Aj is inhnitely-divisible and each Aj is an atom-for-the-measnre. 
This can clearly be extended to cr-hnite /r. 


Proposition B1.8 For the hnite, Borel measure p on [0,1] x [0,1] (with the standard 
topology) and linear map K of 


. 2 , Kp is the zero measure. 
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Proof Take any finite Borel measures /i, nw on [0,1] (with the standard topology). By the 
preceding comment, 

T = To+ Y1 ^ = ^0 + bj6{y.} (B. 6 ) 

j j 

where are inhnitely-divisible, {aj),{bj) are summable sequences of strictly positive 

reals, and {xj}i{yj} are countable subsets of [0,1]. Consider the partition of [0,1] x |0.1| 
given by AOBOC^ where B = (({xj} U {yj) x [0,1])U([0,1] x {{xj} U {yj)), C = {(x, x)\x G 
[0, l]}\i?, and A = [0,1] x [0, l]\{(a;,a;)|a; G [0, l]}\i?. Then p is clearly the zero measure on 

AOB and both p, x bj5{yp and x uq are clearly the zero measure on C. Take 

any e > 0. Since po is inhnitely-divisible, we have a hnite partition {Bj} of [0, l]\({a;j}U{|/j) 
where pQ{Bj) is less than e for every j. Then 

po X z/o(C) < ^/io X uo{Bj X Bj) = ^ po{Bj)vo{Bj) 
j j 

<e'^uo{Bj) = ez/o([0,1] \ {{xj} U (B.7) 

j 

Since e was arbitrary, it must be that po x z/q is the zero measure on C as well. Therefore, 
p -L px u. Since p,nu were arbitrary, Kp is the zero measure. □ 

Proposition B1.9 For a-hnite, £^-measures p,nu on X, If nu is absolutely convergent 
with respect to p and p is inhnitely-divisible, then z/ is inhnitely-divisible. 

Proof Suppose otherwise, so by B1.7 there is a subset A that is an atom-for-the-measure 
z/. Since p is inhnitely-divisible, it has the once-divisible property (see B1.7), so there are 
disjoint subsets Bi and Ci in £ with i?i U Ci = A such that both p{Bi) and p{Ci) are 
less than |/i(A). Take Ei the one of Bi,Ci that is an atom-for-the-measure z/. Repeat 
the process with Ei for A and iterate to get a nested, decreasing sequence of subsets in 
£, (Ej). Then p Ej^ = lira j p{Ej) = 0 whereas p Ej^ = lira j p{Ej) = p{A)-a 

contradiction to z/ being absolutely continuous. Hence, z/ has no atoms-for-the-measure and 
is inhnitely-divisible. □ 

B.2 Maps on subspaces of the space of measures 

Let M d AA{X]£) he any absolutely-continuous-complete, norm-closed subspace. 

Proposition B2.1 Any positive linear map L G B{M, AA{y] X)) has 

||L||op = sup Lp{y) (B. 8 ) 
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Proof Since L is positive, |L/i| = |L(/i+— /i )| = |L(/x+) — L(/x )| < + L(/i ) = L|/x| 

using Hahn decomposition, so 

sup Lfi{y) < \\L\\op = sup \Lfi\{y) < sup L^{y) □ (B.9) 

fi&M+, pl{X)<1 

Proposition B2.2 The space of maps B{M, M.{y-, X)) is a vector lattic^ 


Proof For any maps K,L e B{M, M{y; X)) dehne iF V L by (iF V L)(p,)(F) for G M+ 
and F E X being 


sup 


EE max {L^j){Fk)} 


j=i k=i 


n,m E {1, 2,...} 

hi) • • • )Tn ^ ^ 
disjoint Fi,, Fm E X 

E”., ft = A. ur.i i;- = 


(B.IO) 


Clearly, K y L > K and K y L > L. Also, for any J E B{M, Ai{y; X)) satisfying J > K 
and J > L, then J > K y L. Also, clearly (K V L){c ■) = c{K V L) for any real scalar 
c > 0. Since M is a Banach lattic^ it has the Riesz decomposition property 66 that for 
any p, z/ G M+, any pi,..., G such that J2'j=i Tj = P T ^ can be decomposed into 
hi = Pj + ^i with both pj, Vj E for each j and with X]i^=i Pj — P ^^cl YTj=i Also, 

given any partition of F into disjoint, J^-measurable subsets, F = U”L\ Gj = Hj, there 
is a rehnement of both partitions, F = Ufc^i Gj fl Hk. Therefore, {K V L)(p + i'){F) is 
equal to 


sup ^EE max {{Kpj){Fk), {Lpj){Fk)} 

j=i k=i 


E rii 

i=i Ek=i max 


= sup < 


' 2-^j=i 2-^k=i 


max 


iKp,){Gk), 

{Lpj){Gk) 

{Kiy^){Hk) 

{Lu,){Hk) 


= {KyL){p){F) + {KyL){u){F) 


n,m E {1, 2,...} 
hi) • • • ) pn ^ Af 
disjoint Fi,..., F^ E X 

E;=ihi = h+Fur=i^i = ^ 

ni,n2,mi,m2 E {1,2,...} 
Pli • ■ ■ 1 Pni ) 


Z/i, . . . , 

, E M+ 

disjoint Gi 

n 

EX, 

disjoint 

, . . . , ihrn,2 

E X 

E^i/h = 

h, e;:i 

= V 

ur=i = 

^ ur^i = 

= F 


(B.ll) 


partially-ordered, vector space A is a vector lattice if for any a, b G A, there are elements a V b and 
a A b such that a V b is greater than or equal to both a and b, but is less than or equal to any other element 
with that property and a A b is less than or equal to both a and b, but is greater than or equal to any other 
element with that property. 

®A positive cone is normal if 0 < a < b implies |ja|| < ||b||. A Banach lattice is a complete, normed 
vector lattice with a normal cone and with |||a||| = ||a||. 
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Since the cone of measures is generating for signed measures by Hahn decomposition and 
since M has the absolutely-continuous-complete property, K\/ L extends to a linear map on 
all of M. 

It remains to show the image of the map Ky L is indeed the signed measures. Since the 
cone of measures is generating, it suffices to show this with any fi G M~^. Take any countable 
collection of disjoint, measurable subsets C F. By the preceding argument using 

Riesz decomposition and rehnement of partitions, for any n E {1, 2,.. 

n / oo \ / oo \ oo 

^(A-VL)(,,)(F,) + (A-VL)M [J fA <(A'VL)(,,) MJf; <5;(AVL)M(A,) 

j=l \j=n+l / \j=l / j=l 

(B.12) 

However, Kfi < {K \/ L)ij, < \Kfi\ + |L/i| and both Kfi and \Kfi\ + |L/r| are countably 
additive, so both the tail of the series, {K V T)(/i) ^IJ^n+i 

go to zero as n —)■ cx). Therefore, {K y L)fj, is countably additive. 

Finally, K < K y L < \K\ + |L|, so iF V L is absolutely continuous with respect to 
\K\ + \L\. Hence, Ky L E M. Therefore, B {A4{X-,S),A4{y]F)) is a vector lattice. □ 


Proposition B2.3 With the operator norm induced by the total variation norms, 

B{M, F)) has a normal cone. 

Proof By B2.1, it only necessary to consider elements in the positive cone M~^ to calculate 
the operator norm of any L E B F))^. Hence, for any K E B -F))’*’ with 

K<L, IliFilop < llTlIop. □ 

Proposition B2.4 For any map L E B {M, Miy ] F)), |||T|||op = ||T||op. 


Proof For any such L, 


L||op = sup ||L/i|| 


(B.13) 


which, using Hahn decomposition, is equal to 
singular) 

l|F||op = sup|||L(/r-z/)|| 
Since fi and v are mutually singular, 

IIf- ^^11 = llhll +1 

Then, since |L/r| < |L|/r and \Lu\ < \L\u, 

||F||op < sup ||L|(/r + z/)(3^) 


(using /i T z/ to show they are mutually 


/i, z/ e M+ 1 

T z/, ll/i - Z/|| < 1 j 

(B.14) 

z/| = 1 /i + z/| 

(B.15) 

/i, z/ e M+ 1 

T z/, y + yy) < i j 

(B.16) 
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which can only be increased by not requiring /r and u to be mutually singular, so, rewriting 
fi + v ^ fi, ||-h||op is less than or equal to sup^gjy^+||^||<;^ |L|(/i)(3^), which, by B2.1, is equal 
to II I-hIIIop- 

On the other hand, by B2.1, 


|L|||op= sup \L\{fi){y) 


which, using |L| = L V {—L) and the form of ‘V’ in (B.IO), is equal to 


sup < 


j=i k=i 


n,m E {1, 2 ,...} 

P'1) • • • ) Pn ^ M , 

disjoint Fi,... ,Fm e X, 

ur=i = y 


which is less than or equal to 



” 

ne{l,2,...} 1 

sup < 


^1, . . . , flfi ^ ^ ') ' 

1 


<1 j 


Since, by the ylL-spact^ property of M, 
to 


(B.17) 


(B.18) 


(B.19) 


sup ||Lp|| < sup \\Lfi\\ = ||L||op 
M6M+,||m||< 1 MeM,||/.||<l 


YTj=iTj = llpill) i'his is less than or equal 

□ (B.20) 


Comment By B2.2, B2.3, and B2.4, B{M, Ai(^y; F)) is a Banach lattice. 

For the following proposition, let Q = a{£ x F) and 77 = ct(77 x X). Let the absolutely- 
continuous-complete subsets M C A1(A’;X), N C A1(3^;X'), and Q C A4(A x y;Q) also be 
vector spaces. 


Proposition B2.5 If M ® is norm-dense in Q, then for any linear maps 
L e and K e B{N,M{W]X)), the map L®K :Q ^ M(Z x >V;77)) is 

well-defined and satisfies \L® K\ = |L| ® \K\ and ||X ® 77||op = ||X||op||77||op- 

Proof Both 

L®K ={L^ -L-)® (77+ - K-) = X+ 0 77+ - X+ 0 XT" - X" 0 77+ -7 X" 0 XT" (B.21) 

Banach lattice is an AL-space if ||a -1- b|| = |ja|| + ||b|| for a, b in the positive cone. 
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and 


-L^K = -{L^-L-)®{K^-K-) = -L^^K^ + L^®K- + L-®K^-L-®K- (B.22) 

are clearly less than or eqnal to (in the partial ordering) 

\L\ O \K\ = (L+ + L-) (g) {K^ + K-) = ®K^ ®K- +L-TL-®K- (B.23) 

so \L®K\ < \L\®\K\. However, by B1.4, B1.5, B2.1, B2.4, and the dehnition of operator 
norm, |||L| g) \K\\\ < ||L||Hit'll. Hence, by A1.3, L®K is well-dehned. 

However, given any measnres u G and p G N^ and snbsets H & PL and / G X, 


|X ®K\{ux p){H X J) = snp ^ ^ |(X ® K)pj\{H x I) 

i=i 


nG{l,2,...} 
hi 1 • • • 1 Tn ^ Q 

E "- 

j=i Pj = jyx p 


(B.24) 


nsing a similar argnment to that for the proof of B2.4. By Riesz decomposition, this is 
eqnal to 


{ n m 

K){uj X pk)\ {H X I) 

j=i k=i 


n,m e {1, 2,...} 

z/i,..., G M+, pi,...,pmeN+ 
j=i = p Efc=i pk = p 


(B.25) 


which, by the triangle ineqnality, is greater than or eqnal to \{Lu) x {Kp) \ [H x I). By the 
property of signed measnres that \p x p'\ = \p\ x \p'\, this is eqnal to |Xz/|(if)|it'p|(/). 

The rectangnlar snbsets "H x X generate the a-algebra J', so |X ® KKu x p){A) 

> ((|X|i/) X {\K\p)) (H) for any A & J. By the assnmption in the proposition and A1.3, 
|X ® K\{p){A) < (|X| ® \K\) {p){A) for any p G Q^. Hence, \L ® K\ > \L\ ® \K\, so 
\L®K\ = \L\ ® \K\. 

We already have ||X ® K\\op < ||X||op||it'||op- On the other hand. 


op 


p G M+,p{A) < 1, 
1 / G N+,u{y) < 1 


,op = snp||X|(/i)(X)|A|(z/)(>V) 
<snp|(|X|®|A|)(p)(Xx W) 
so ||X® A||op= |||X|® |A|||op = |||X|||op|||A|||op = ||X||op||A||op. □ 


(B.26) 




op 


Corollary B2.6 Positive maps L G B (M, (3^; X”))^ are completely-positiv^ 

Proof Use B1.4, B2.5, and the positivity of both maps L and Ib{n)- XI 

Analogonsly to the psendo-fnnctions which compose the donbly-dnal space C{Z)** for C{Z) 
the continnous fnnctions on some compact space Z (see 113| ), we have the following: 

® A map L is completely-positive if L (g) Ib{n) is positive for every N of the form previously given. 
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Definition B2.7 A pseudo-transition “function” with data (M, Aify^iF)) has the 

properties: (i) for each measure p G M+, r^(-|-) is a /i)-valued vector J^-measure on 

y; (a) for each B E X and /i G M+, r^(i?|-) is essentially bounded with respect to /i (so it 
is in /r)); and (Hi) if fi E M+ is absolutely continuous with respect to n E M+, then 

for any B E IF, t^{B\-) = ryB)) as elements in L^(A’;/i). 

Comment Using A3.7, it is possible in many commonly encountered situations to lift a 
pseudo-transition “function” to a pseudo-transition function, but we will not further pursue 
this. The space of pseudo-transition “functions” is clearly a vector space. It is a Banach 
space under the norm 

||r.(-|-)|| = sup \\Tf,{-\-)\\ (B.27) 

M 6 M +,|| m ||<1 

Proposition B2.8 The space of maps B {M, JiA{y-, F")) is isometrically isomorphic to the 
space of pseudo-transition “functions” with data (M, M(y; F)). Furthermore, this isomor¬ 
phism takes the positive cones in each space to one another. 


Proof Given such a pseudo-transition “function” t.(-|-), define L by 

(L/i)(S) = [ r|^|(S|x) d/i(a:) (B.28) 

Jx&X 

for any B E F and fi E M. Then Ljj, is indeed a signed measure since it is countably additive 
because r is a vector measure. Since T|a^|(-|-) = T|^|(-|-) for any a G M \ {0}, L(a/i) = aLfi 
for any a G M. Also, for any u E M, 


(L(p + i,))(B) 


' x^X 


T\f,\{B\x) dpi,{x) + / T\y\{B\x)dv{x) 


ixex 


' xGX 


IxGX 


^M+W\iB\x) d{fi + n){x) 
Ti^+,,l{B\x) d{n + u){x) 


Hence, L is linear. To see that it is bounded, we have 

||L||op = sup ||T/i|| = sup \Lfi\{y) 

By Hahn decomposition |100|, this is equal to 


sup |(L/i)(A) - {Lp){B) 


h e M, ||/i|| <1, 1 

disjoint A,BeF J 


(B.29) 

(B.30) 

(B.31) 

(B.32) 

(B.33) 
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which is equal to 


sup 


j 


h e M, ||/i|| < 1 , 

hnite, disjoint collections C IF 


(B.34) 


This is bounded above by 



he M+, ll/iil < 1 , 

hnite, disjoint collections {Aj} C IF 


= sup 

M 6 M+,||m ||<1 


l|r.(-|-)ll 

(B.35) 


Finally, if is positive, L is clearly positive. 

Now suppose we are given such a map L. For any measure /i G M"*", let be the subspace 
that is absolutely continuous with respect to fj,. By the Radon-Nikodym theorem, is 
isometrically isomorphic to Therefore, the adjoint map takes Aiiy^IF)* 

to jj)* = yu), which is a subspace of L^{X] n) since fi is hnite. For each B & IF, 

dehne t^{B\-) to be where $5 G AAiy-, IF)* is the linear functional that evaluates 

a signed measure on the set B. Then Tyi(-|-) is a /i)-valued vector measure since the 

countable additivity of /i implies Tyi(-|-) is countably additive. If jj, is absolutely continuous 
with respect to z/ G M+, then C so t^{B\-) = ri,(R|-) as elements of L^(T’;/i). 
Therefore, t.(-|-) is a pseudo-transition “function”. To see that t.(-|-) is bounded in norm, 
we have 


|r.(-|-)|| = sup ||ry,(-|-) 


(B.36) 


= sup 


< sup 


heM+,||yu|| < 1 , 

hnite, disjoint collections {Aj},C IF 


T e M+, ||/i|| < 1 , 

hnite, disjoint collections {Aj}, C IF 


However, 

^^/i(Aj) ||'r/i(Aj|-)llL°o(A’;^) “ ^ I j Hop < l|-^l|op 

3 3 \ 3 / 

(B.37) 

so ||'r-(-|-)ll ^ ll-^llop- Lastly, if L is positive, so must be t.(-|-); otherwise, if there were some 
jj, G and B E IF such that r^(i?|-) were strictly less than zero on a set A G T with 
/i(A) > 0, then 


{L{lA^F)){B) = f {L\M^)*^BdiJ, = f Tf,{A\x) dfx < 0 

Ja Ja 


(B.38) 


which would be a contradiction. □ 
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It is also possible to define r.(-|-) in terms of L rather than adjoints of restrictions of L using 
the Radon-Nikodym derivative: 






d (((-^|m^)*^b) h) 

dfj, 


dfiB 

d/i 


(B.39) 


where /is is the £^-measure on X given by ^b{A) = (L(lyip)) (B). Using this, we have the 
following: 


Proposition B2.9 The positive map L ^ B {M,Ai{y; X))~^ is norm-preserving on the pos¬ 
itive cone if and only if the associated pseudo-transition ’’function” r.(-|-) satishes = 

[1;^] for every measure fi G M+. 


Proof Suppose such a positive map L is norm-preserving on the positive cone; then, for 
any measure p G and subset A E £, 

(L(U/i)) (3;) = = t{A) (B.40) 

Therefore, using the notation of the preceding comment, /ly = p, so = ^ = [la’]- 

Conversely, suppose r.(-|-) is such that r^(3^|-) = [l^’j for every measure /i G M+. Then, for 
any measure fi G M+, 

(L/i)(3^)= [ T{y\x)d^i{x)=fi{X) (B.41) 

Jx&X 

SO L is norm-preserving on the positive cone. □ 


B.3 L^-spaces 

Notation In the following, let X, 3^,... denote sets and /i, i/,... denote a-hnite measures. 
Hilbert spaces, denoted H, J,... are complete, sesquilinear inner-product spaces, with no 
restriction as to their dimension or separability. 'P(H), T’(J),... denote the spaces of density 
matrices (trace-class, self-adjoint operators) on the specihed Hilbert space. These spaces 
are Banach spaces employing the trace norm. L^(A’;p), L^(3^; i/),... denote the space of 
integrable, real-valued “functions” on the given sets with respect to the given measures. 
These spaces are Banach spaces employing the LUnorm. L^{X] /i;P(H)), T^(3^; i/;'D(J)),... 
denote the space of Bochner-integrable, density-matrix-valued “functions” on the given sets 
with respect the given measures. These spaces are Banach spaces employing hrst the trace 
norm pointwise, then the L^-norm. For n G {1, 2,...}, AA.n is the space of n x n-matrices. 


Comment The following proposition strengthens the well-known result, which is a special 
case of a result by Grothendieck 17 , that L^{X x y-,T X u) = L^{X] /i)®L^(3^; z/), where ® 
indicates completion in the projective norm|^ 


^The projective norm on A (X) B is the norm induced by duality with Bilinear(A, B), 


C A = 


infE, 


|bjH, where the infinum is taken over all Ei 


bj G A ( 


B that equal c. 
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Proposition B3.1 The finite-nonnegative-tensor-ranl“functions” in xy-, 
with respect to “functions” in fi)~^ and n)+, are dense in the norm topology. 


Proof Take any f & xy-, fi x u)~^. f can be arbitrarily well-approximated in x 

y] jjiX n)-norm by simple functions: 'Yl,j ^j^Aj for finite collections of positive reals {aj} and 
finite /i X //-measure subsets {Aj}. By the construction of product measures (see (ig). 
each Aj is covered by some finite collection of disjoint, measurable, rectangular subsets 
[B, X Ck] with fixu ((IJ^ Bk X Ck) \ Aj) arbitrarily small. Hence, / can be arbitrarily well- 
approximated in xy; fix z/)-norm by simple functions: ^ • a,TB,xC, = ® Ic, 

for finite collections of positive reals {%}, finite p-measure subsets and finite //-measure 
subsets {Cj}. □ 


B.4 Density-matrix-valued L^-spaces 

Proposition B4.1 L^(A’;/i; T>(H)) ® //; T>(J)) is trace-norm dense within x 

y-fixp-V{H®S)). 

Proof Given operator p G 'D(H ® J) , by the spectral theorem for compact operators, p 
can be arbitrarily well approximated in trace-norm by sums of the form G) e* for 

some {ajjj'^i C M and orthonormal collection of vectors C H 0 J. By definition, 

each Sj can be arbitrarily well approximated in H 0 J-norm (so 0 e* will be arbitrarily 
well-approximated in trace-norm) by sums of the form G) gk for some bk G C, 

ffc G H, and gfc G J. Using polarization, 

bkh ® gfcj G> f ^ fofr G) gej (B.42) 


Hi -| 

= bkb£- ((fo -1- fg) 0 (ffc -1- f^)* — (ffc — f^) 0 (ffc — ig)* 
k,i=l 

+*(ffc lig) 0 (ffc zig)* - *(ffc - ilg) 0 (ffc - ilg)*) 

((gfc + g^) ® (gfc + g^)* - (gfc - g^) ® (g/c - Si)* 
+t{gk + ^gi) 0 (gfc + ^ge)* - t{gk - ^ge) ® (gfc - ^gi)*) 

m 

=yy (fo ® ffc) (gfc ® gD 

k=l 

+ ^ yy (((ffc + ie) G (fo + ie)* - (fo - h) ® (fo - m 

^ k<i 


^°Using only positive real scalars. 
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® ((gfc + Si) ® (gfc + gi)* - (gfc - gi) ® (gfc - g^)*) 

+ ((ffc + tie) ® (ffc + ^ii)* - (ffc - 't-ii) ® (ffc - ^UT) 

(g) ((gfc + ig,e) ® (gfc + tge)* - (gfc - igt) ® (gfc - *g£)*)) 

-%hhe) (((ffc + f,) 0 (ffc + f,)* - (ffc - f,) 0 (ffc - f,)*) 

0 ((gfc + tge) 0 (gfc + tge)* - (gfc - tge) 0 (gfc - ^g^)*) 

+ ((ffc + tie) ® (ffc + ^ii)* - (ffc - 'lii) ® (ffc - ^f^)*) 

<8) ((gfc + g^) 0 (gfc + ge)* - (gfc - g^) g> (gfc - g^)*))) 

Then, using the dehnition of Bochner integrable functions and following the argument in the 
proof of B3.1 gives the desired result. □ 


Corollary B4.2 The Bochner integrable “functions”!^ L^(A’; p; 5i(H)) 0 L^(3^; z/; 5i(J)) 
are trace-norm dense within L^{X x y-.T X z/; 5i(H0j)). 


Proof Using Cartesian decompositiori, for any Hilbert space K, i5i(K) = 'P(K) +tD{K). 
Then using the preceding proposition gives the desired result. □ 


Proposition B4.3 If the cx-hnite measure fi is inhnitely-divisible, there is no non-zero, 
diagonal^ operator in V{L‘^{X] /i)). 

Proof Take p G 'D{L‘^{X; p)) to be diagonal. Take any “function” / G L‘^{X]p). Take 
any e > 0. Since p is inhnitely-divisible, the measure \ f\^p is inhnitely-divisible by B1.9, so 
there is a countable partition {Bj} of X such that \f\^p{Bj) = |/p dp < e for every j. 

Then, since p is diagonal, 

(p/./> = X^pIbJAbJ) = (B.43) 

3,k j 

Using the spectral theorem for compact operators to write p = fo^ i^j) 

absolutely-summable sequence, orthonormal {V'j} C X]p), and tfj* the linear functional 
{-Ttlij), this becomes 

= (B-44) 

j k 

^^The space are the trace-class operators. 
i^Writing p = i(p p*) -h z • - p*) for p G 5i(K). 

operator p G I?(L^(A;p)) is diagonal if (p1b,1b/) = 0 for every disjoint, p-measurable B,B' with 
finite p-measure. 




APPENDIX B. PROPOSITIONS FOR OPTION I 


124 


Since the expression | (1^^,/, is bounded by the Cauchy-Schwartz inequality and (Xj) is 
absolutely-summable, the sum in k is uniformly convergent, so (p/, /) is equal to 

(B-45) 

k j 

The “functions” {1^^./} are orthogonal, so 

{pfj) < -supIIIb./II^ = ||p||trace-£ (B.46) 

Since £ was arbitrary, it must be that (p/, /) = 0. Since / was arbitrary, it must be that p 
is the zero operator. □ 


B.5 Maps on density-matrix-valued L^-spaces 

Proposition B5.1 Any map L G p; ©(H)), z/; T>(J)) satisfies 


||L||op = sup < 

iiipii 

pen(A';p;I>(H)),||p||<l, 1 

p is pointwise almost-every- ) 


1 

where rank one J 


Proof By the dehnition of Bochner integrable functions and by the spectral theorem for 
compact operators, simple functions with values in the finite-rank, self-adjoint operators are 
p; ©(Hjj-norm dense. Therefore, ||T||op is equal to 




sup < 


n m 




n,m E {1, 2 ,...}, disjoint 
p-measurable subsets {Ai,..., An}, 
collections of orthonormal 
elements of H 

{All, • • • ) Xnm} T M, 

E;=iEr=iiAi.iM^i)<i 


(B.47) 


where Bj* is the functional Now £x a value of j, the Ai,...,A„, all the collec¬ 

tions of orthonormal vectors, and all the A’s except Aji,... ,Xjm- The set of all elements 
of of L^{y-, zz;T>(J) given by Er=i Ajfc-h ® Bj*) for fixed |Ajfc| is a finite¬ 

dimensional, convex subset; hence, its maximum value o£ norm necessarily occurs at its 
extreme points where one Xjk is nonzero whereas Xji ,..., Xjk, ■ ■ ■, Xjm are all zero. Since the 
choice of j was arbitrary, this is true for all j, so the supremum is unchanged by restricting 
to m = 1. □ 
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Proposition B5.2 Any positive map L G /i; ©(H)), z/; D(J))’'' satisfies 


||L||op = sup < 

iiipii 

peL\X-p-V{H)y,\\p\\<l, 1 
p is pointwise almost-every- ) 



where rank one J 


sup < 


^ X,L ® e*) 

i=i 


Proof By the proof of the preceding proposition, ||-h||op is equal to 

n G {1,2,...}, disjoint 
/i-measurable subsets{Ai,..., An}, 
unit-norm el,..., G H, (B.48) 

(Ai,..., A„} C M, 

E;=iiAiiMAii)<i 

The supremum can only be reduced or stay the same by restricting to positive Aj’s. However, 
since L is positive, by the triangle inequality and the quasi- AL-propert}f^ of z/;T>(J)), 


(U.e,- ® e}) 
i=i 




^|A,|L(U^.e,®e}) 

i=i 


(B.49) 


so the supremum can also only be increased or stay the same by restricting to positive A^-’s. 
Therefore, it must have the same value. □ 


Corollary B5.3 The cone of positive maps B (T^(A; /i;T>(H)), L^{y-, i>-,T>{I)))'^ is a normal 
cone for the induced operator norm. 


Proposition B5.4 If L G H (L^(A’; p; P(H)), L^(3^; z/; ©(J))) is completely boundecf^ 
then, for any space Z, any measure r, and any Hilbert space K, \\L ® /||op < ||T||inatrix 
with I the identity map in B {L^{Z;t;'D{K))). 


Proof By the dehnition of operator norm and the dehnition of the tensor product of maps, 
\\L ® /||op is equal to 


sup 


||(L®/)p|| 


hnite-tensor-rank p E x Z] p x t] 'D(H ® K)) 

with IIpII < 1 


(B.50) 


By the proof of B5.1, it is only necessary to take the supremum over simple functions taking 
values with rank one. By the argument in the proof of B3.1, the sets in the simple functions 
can be restricted to being rectangular. This eliminates consideration of Z and r, replacing 
them with positive, real scalars that can be incorporated into the operators. Finally, rank- 
one, tensor-rank-n operators live on a n-dimensional subspace of K, which can be identihed 
with C"". Making the identihcation L^(one point; trivial measure;'D(C")) -H- ©(C") then 
gives that ||L 0 /||op is equal to sup„<dij^K \\LZ iMnWop- □ 

Banach space has the quasi-AL-property if ||a -|- b|| = ||a|| + ||b|| for positive a, b. 

is completely bounded if it has finite matrix-norm, ||A|jmatrix = sup„ \\L 0 /M„llop- 
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Proposition B5.5 If the positive map L G 13 /x; P(H)), i/;'D(J)))’'" is such 

that, for some space Z, some measure r, and some Hilbert space K, L ® / is positive with I 
the identity map in B r; ©(K))), then ||L (g) /||op = ||-h||op. 


Proof Starting as with the preceding proof, we arrive at the point where ||L(g)/||op is given 
as 




sup < 


(g) I){aj\j (g) V* (g) IsJ 
i=i 


hnite-tensor-rank 
{vi,..., v„} e H (g K, 

{ai,... ,a„} C M, > 

/i-measurable {Hi,..., H„}, 
r-measurable {Bi, ..., Bn}, 
withEi=i \aj\\\^j\\^KAj)r{Bj) < 1 ^ 


(B.51) 


where v* is the functional (^Vj). Since L g / is positive, as in the proof for B5.2, it is 
possible to restrict to positive a^-’s without changing the result. Then aj can be combined 
with T{Bj) and both incorporated into a change in the norm of Vj. Furthermore, since Vj is 
of hnite-tensor-rank, it is necessarily in H g for some hnite-dimensional subspace C K, 
so it can be written as x-^ g ei with {eU an orthonormal basis for Lj. 

Then we have for ||L g /||op, 


n e (1, 2,...}, m G (1, 2,...}, m < dim K, 
p-measurable (Hi,..., An}, I (B.52) 

By taking the x’s to be unit length and introducing new, real scalar variables for their 
squared norm, then by following the argument in the proof of B5.1, the supremum is the 
same if m = 1. Hence, \\L g /||op is equal to 


EE 

j=l k=l 


\H^l 




n,)| 


sup < 

Wtr |[L(xj0x*1aj) 1 

nG{l,2,...}, ^ 

/i-measurable (Hi,..., An}, ) 



{xj} C W^jf fi{Aj) < 1 J 


which is ||T||op. □ 


(B.53) 


Corollary B5.6 The completely-positiv^^ maps 




are completely bounded. 


is completely positive if L 0 /a 4 „ is positive for every n G (1, 2,..., }. 
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Corollary B5.7 The cone of completely-positive maps, 

is a normal cone for either the induced operator norm or the matrix norm. 

Proposition B5.8 If L G B L^{y;iy;V{I))) is completely positive, then, 

for any space Z, any measure r, and any Hilbert space K, L ® / is positive, with I the 
identity map in B {L^{Z; r; V{K))). 

Proof Since L is completely positive, by B5.6 it is completely bounded. Hence, L Z I 
exists by B5.4. Furthermore, by B4.1 and A1.3, L 0 / is unique, so it is meaningful to 
speak of it being positive. 

Now suppose there were some space Z, some measure r, and some Hilbert space K such 
that L<Zl were not positive. Then there would be some positive p E {A x Z] jj, x t] T>(H ® 
K))+ such that {L Z I)p is not positive. Since the cone L^{y x Z]i> x r;T’(J ® K))'*' is 
norm-closed and L Z I is continuous, that implies there is a relatively open neighborhood 
of p in the cone L?-{X x Z] p, x t] D{V\ Z K))+ whose image under L Z I does not intersect 
L\y xZ;iyx r;T>(J 0 K)) + . 

Now approximating p as in the proof of B5.4, one hnds that for this to occur there must 
be some n G {1,2,...} for which L Z Ijvin is not positive; however, that is a contradiction. 
□ 


Proposition B5.9 If either dimH or dimJ is finite and if a positive map 

L e B {L'(X;KT>m,Lyy;’';V{J))y 

is such that L Z Im^ is positive for m = min{dim H, dim J}, then L is completely positive. 

Proof Clearly, since L Z iMm is positive, so is L 0 /_a 4 „ for all n < m. Now take n > m. 
LzImu 'will be positive if for every p G p]V{HzPT))^, y G J(8 )C”, and //-measurable 

Bey, 

[ {{{LZ lMjp)y,y)jxc^du >0 (B.54) 

J B 

By the dehnition of Bochner integrable functions and the spectral theorem for compact 
operators, it is enough to show this for p that are simple functions with value in the rank- 
one operators. 'We then have to consider, for any hnite collection of vectors {xj} C H 0 C” 
and yU-measurable subsets {Aj} 

V / {{Lz iMrMNj®^*j^A,)y,y)s®c'-dv 


(B.55) 
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where x* is the functional {•jXj). Writing y = ® ~ J2k=i^jk ® 

{e^} an orthonormal basis for C" gives (B.55) as 


n „ 

L {L{wjk 0 w*; Vfc)j du 


(B.56) 


However, ® rank at most m, so there are {wj^} and {v/} such that 

n m 

^ w*; 0 Vi = ^ w*; ® Vi (B.57) 

1=1 1=1 

The condition for L 0 Ij^^ to be positive is then that for every //-measurable B G y, hnite 
collection of p-measurable subsets {Aj}, and hnite collections of vectors {wj^} C H and 
{vfc} C J, 

m p 

TY. I {L{wjk 0 w*i 1a^ ) Vi, Vfc)j diy>0 (B.58) 

j k,i=i-^^ 

However, this condition is independent of n, as long as it is greater than or equal to m. □ 


Proposition B5.10 The space of completely-bounded maps, 

is a Banach space with respect to the matrix norm. 

Proof Let (Lj) be a Cauchy sequence in the matrix norm of such maps. Since the matrix- 
norm is greater than or equal to the operator norm, this is a Cauchy sequence in operator 
norm, so since the space is a Banach space with respect to the operator norm, it converges 
to some Loo in that norm. It remains to show that Loo is completely bounded. For each 
u G {1, 2,...}, by the triangle inequality, \\K 0 Im„\\op < n^ll-^llop for any linear map K G 
B {L^{A-, jj,] T>(y\)), L^iy] //; L>(J))); hence, the functional iF —>■ ||iF 0 ImA\ov is continuous in 
the operator-norm topology. Since {Lj) converges to Loo in this topology, it must be that 

||Loo 0 Cvij^llop lim ||Lj 0 Cvi„||op Y lim ||Lj||matrix (B.59) 

j^OO 

The right-hand limit necessarily exists since (Lj) is a Cauchy sequence. Therefore, ||Loo Umatrix < 
limj_s.oo ||LjUmatrix, SO Loo is Completely bounded. □ 

Proposition B5.11 The subset of B (L^(T’; p; ©(H)), L^(3^; z/; L>(J))) for which the tensor 
product with Ij^^ is positive for some hxed n G {1, 2,...} is closed in the weak topology. 
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Proof We will show the complement is open. Take such a map L that is not in the subset. 
By the argument in the proof for B5.9, that implies there are some //-measurable B G y, 
hnite collection of /i-measurable subsets {Aj}, hnite collections of vectors {wj^} C H and 
{vfc} C J, and e > 0 such that 


m n p 

EE/ {L{wjk^w*i IaJvz, Vfc)j(iz/< -e 

1=1 k,i=i 


J = i k,l=l 

Then, by the triangle inequality, all the maps in the weak neighborhood 


nn-vb ; Ia,Ni ® V;* Is; 


e 


j=l k,l=l 


2n^m 


(B.60) 


(B.61) 


/finn^h ; {Wjr ® W*i -h Wji 0 ; Vfc 0 Ib] 

j=lk=lr<l 
m n 

+nnn^(^ ; Wjfc (g) ; (vfc 0 v;; + 0 wl) Is; 

j=l k=l r<l 


n^m 


n^m 


+n n n ^ ® ® ® ® 

j=l k<l q<r ' 

will also fail to yield a positive tensor product with Imu- 


2e 

n'^m 


Corollary B5.12 The cone of completely positive maps is weakly closed in 


Comment The preceding result also follows from showing that the spaces are norm-closed, 
then using the separating theorem to argue that the weak and norm topologies have the 
same closed, convex subsets. The approach followed here is preferable since, as has already 
been noted, the separating theorem depends on the axiom of choice through the Hahn- 
Banach theorem. For the case where the measures are trivial, the space is a dual space, 
T’(H) = /C(H)*, so it is possible to do better. 

Proposition B5.13 The subset of B (©(H), T>(J)) for which the tensor product with Im„ 
is positive for some hxed n E {1, 2,...} is closed in the weak* topology. 


Proof Following the proof of B5.9, if L (g) Ij^^ is not positive, then there are some hnite 
collections of vectors {w^} C H and {v^} C J, and e > 0 such that all the maps in the weak* 
neighborhood 


Pi Af WfcWfc IaF V/ O v; 1^; 

k,l=l 


e 


(B.62) 



APPENDIX B. PROPOSITIONS FOR OPTION I 


130 


; (w^ (g) W;* + W; ® w*) 1a,.; Vfc (g) 1^; ^ j 

k=lr<l 


n 

+nn^( L; Wfc ® Wfc 1 a,. ; (Vfc g) v;;, + Vm (g) Vfc) Is; ^ j 
k=lr<l 



will also fail to yield a positive tensor product with IjUn- 

Corollary B5.14 The cone of completely positive maps in B (T>(H),D(J)) is closed in the 
weak* topology. 

Proposition B5.15 Any bounded, positively-linear map L to z/; T>(J)) that is given 

on the positive cone of L^{X'i /i;'P(H)) extends uniquely by linearity to a map 

Proof Let L be any such map. Extend L to /r; T>(H)), z/; T’(J)))+ by 

L(p) = L for any p e L^(A’;/i;T>(H)) (B.63) 

where | ■ | is applied pointwise with |p|(a;) = |p(a;)| = a/ p{xy. It is readily seen that positive 
linearity implies the extended L is now linear over M. Furthermore, by B5.2, this extension 
does not increase the operator norm. 

Proposition B5.16 Any map L G (L^(A; p,; T>(H)), L^(3^; z/;'D(J)))’'’ extends uniquely 
by linearity to a map L G (L^(A; p; iSi(H)), L^(3^; z/; i5i(J)))’''. This extension has operator 
norm less than twice that of the restricted map. 

Proof Let L be any such map. Extend T to L G p; iSi(H)), L^(3^; z/; iSi(J))’'') by 

using Cartesian decomposition, 

L{,p) ^ for any p G L^(A;p;5i(H)) (B.64) 

where * is applied pointwise with p*{x) = p{x)*. It is readily seen that real linearity implies 
the extended L is now linear over C. By dehnition, this extended map is a positive one. 
The triangle inequality implies the extension has operator norm less than twice that of the 
restricted map. □ 
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Comment Using 


{L\X; fi)) C ^2 {L\X-, /i)) = {A X A; X fi) (B.65) 


(where S 2 are the Hilbert-Schmidt operators) one might imagine a map diag that extracts 
the diagonal of an operator p E Si [A; p,)) by diag p{x) = p{x,x). This does not work 
since the actual values of p are completely arbitrary on the diagonal. However, there is a 
useful map which does correspond to this naive notion. By singular-value decomposition, 
any p E Si {A;p)) can be written as the countable sum ® 9j i'^j) ^ 

and some unit length {fj,gj} C L‘^{A;p). There is a measure u associated to p, clearly 
absolutely continuous with respect to p, given by for any p-measurable subset B C A, 


u{B) = tr PbP 



where Pb is the orthogonal projector associated to the subspace of L‘^{A-,p) that is zero 
(almost everywhere with respect to p) outside B. Then we have: 


Definition B5.17 

Nikodym derivative 


Let the map diag: iSi (L^ {A] p)) ^ {A] p) be dehned by the Radon- 

,. dv 

diag P= 

dp 


The diagonal extraction map is clearly linear and positive. We also have the following 
properties: 

Proposition B5.18 The diagonal extraction map has operator norm one. Furthermore, 
on the equivalence classes iSi (L^ {A; p)) /ker diag, it is an isometric bijection. 


Proof Using the singular-value-decomposition form for the trace-class operator p as above. 


lldiag p\\ = 


IX 




dp 


dp 

Jx 


(B.66) 


using the triangle inequality and the Dominated Convergence theorem. Using the Cauchy- 
Schwartz inequality, this is less than or equal to 




trace 


J 


(B.67) 
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Since p was arbitrary, ||diag||op <1. It is equal to one since diag is clearly an isometry on 
the positive cone V p)) . 

If the (j-algebra for X were generated by countably many atomic subsets, it is easy 
to see that diag is an isometric bijection-one simply notes the diagonal operator whose 
diagonal agrees with the diagonal of p is in the same equivalence class and that for this 
particular operator, the map clearly preserves the norm. However, in general, there are 
no diagonal, trace-class operators on so we need to work a little harder. Let 

s G L°° [X] (diag p)p) be the phase valued “function” given by 


diag p 
I diag p\ 


(B.68) 


(it can be left undehned where diag p is zero). Then the trace-class operator ^ = Sv^diag p® 
■\/diag p has the same trace as p; hence, it is in the same equivalence class. Therefore, 


= inf 

/3e[p] 


trace < lien trace lldiag pII 


which by the above result is less than or equal to 


(B.69) 


inf llplltrace (B.70) 

/36[p] 


Hence, ||[p]|| = ||diag p||. Since p was arbitrary, diag is an isometry. It is surjective since for 
any “function” f & [X] p), dehning s similarly to above. 


s = 


f 

I/I 


(B.71) 


(it can be left undehned where / is zero), then the trace-class operator ^ = s^/J ® yjf has 
diag e = /• □ 


Proposition B5.19 Both the diagonal extraction map diag and its inverse diag ^ [X] p) 

5i (L^ [X] p)) /ker diag are completely-positive. 


Proof By the spectral theorem for compact operators, it is enough to demonstrate posi¬ 
tivity for diag ® /x„ acting on rank-one density matrices. Furthermore, since hnite-tensor 
rank vectors are dense, it is enough show that 

( m m \ \ 

® fk ® a* J b, b \ dp>0 (B.72) 

j=i k=i J / 

for any m,n ^ Z+, any {aj} C C", any b G any {fj} C (df; p), and any p-measurable 
subset E C X. However, the left-hand side is equal to |pp dp, where g = fj^ 

which is clearly greater than or equal to zero. 
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Firstly, note diag“^ ® Im„ is well-defined since 
Si [L^ C") /ker (diag ® Imu) = (‘^i {L^ [X] /i)) /ker diag)) 0 Mn (B.73) 

Furthermore, by A1.3, diag“^ ® -^Li(y;i^;‘Si(H)) is well-defined for any 3^, i/, H since 

(5i (L^(A;/i)) /ker diag)) ® L^{y-,iy;Si{H)) (B.74) 

is norm-dense in 

Si (L^ {X X y-, fi X u) ® H) /ker (diag 0 lLHy-,u-,Sim) (B.75) 

(This will be needed in further propositions below.) A notion of positivity is provided 
on (5i (A;/!)) /ker diag)) 0 by taking equivalence classes with positive members 

to be positive; this is equivalent to a being positive if tr (3a > 0 for any positive (3 G 
yV{X;n; C”)+ = diag /i)+ 0 T>(C"')''‘ (W is dehned below in B5.21, diag * is dehned 

following this proof). Any such positive (3 can be written as 7*7 for some 7 G W(A; /r; C”) = 
diag *L~(A;/i)+ 0i3(C”). 

Using the preceding, together with the dehnition of Bochner integrability for L^{X] fi] D{C"')) 
and the (hnite dimensional) spectral theorem, to show diag”^ is completely positive we need 
to show 

tr ( 7 * 7 (diag“^ 0 /;n„)lijaa*) > 0 (B.76) 

for any /i-measurable subset i? C A, a G C"", and 7 G diag *L°°{X; /x)+ 0 f3(C"'). Writing 
7 as orthonormal basis for C" and fij G L°°(A; /i), the left-hand 

side is 

I 9j9j (B.77) 

J B 

where au = (a,e^) and gj = J22=i^kfjk- This is clearly positive. □ 

Comment Using B5.18 with standard results for Banach spaces then gives the following 
bijection: 

diag* : L°° (A; /x) —)■ image diag* C B (L^ (A; /i)) (B.78) 

Note diag*(/) 5 f is the pointwise multiplication fg for / G L°° (A; /i) and g G {X; iT). Then 
diag* is clearly isometric and is a representation (clearly weak* continuous) of the von Neu¬ 
mann algebrgp^ L°° (A; /i) (with product given by pointwise multiplication of “functions”) 
as operators in B (L^ (A; /x)). 

Morally, one would hope for (A;/r; 5i(H))* to be (A;/r; i3(H)), which is a (7*- 
algebra. That is almost correct, as the following proposition indicates. 

Banach algebra is a Banach space equipped with an associative, distributive product satisfying |ja6|| < 
||a||||6||. A *-algebra is a Banach algebra with an antilinear involution. A G*-algebra is a *-algebra where 
||a*a|| = ||a|p. A von Neumann algebra (also termed a W*-algebra) is a C'*-algebra which, as a Banach 
space, is a dual space. 


^ ^ I fjifjk dp. ^ ^ 

i,j,k = l dS 
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Lemma B5.20 For any 4/ G i5i(H))*, 


ll'FlIop = sup 


1 




yu-measurable B C X 
unit length G H 


Proof By the dehnition of the Bochner integrability for (df;/r; iSi(H)), it is enough to 
take the supremum over simple, iSi(H)-valued functions: 

u-measurable, disioint \BA 

each with unit trace-norm 
{t,} C M+ U {0} 

For hxed {Bj} and {(pj}, the expression the supremum is being taken over is rational linear 
in {tj}] hence it is necessarily maximized when all but one value of tj are zero. Therefore, 
it is only necessary to take the supremum over iSi(H)-valued functions of the form 
for yU-measurable B G X and unit-trace-norm p G 5i(H). By a similar argument applied to 
the singular-value decomposition of p, it is further only necessary to take p rank-one. □ 




Proposition B5.21 There is a well-dehned, completely-positive, surjective map with op¬ 
erator norm one 


diag ® /h(h) : {L^X; /i) ® H) ^ (df; /r; 5i(H)) 

which is an isometry on the positive cone. This map gives rise to the isometric bijections 

diag(8)/6(H) ; Si {L^{X]p) 0 H) /ker (diag O/^(h)) {X; jj,; Si{H)) 

(diag® /^(H))* : (df;/r; 5i(H))* image (diag ® /^(h))* = >V(df;yu; H) C (^;h) ® H) 

Proof Using B4.2, Si (L^ {X; jT}) ® iSi (H) is trace-norm dense in iSi (L^ (df; /i) ® H). By 
the dehnition of Bochner integrability, (X; fi] iSi(H)) is the norm completion of {X\p)® 
i5i(H). Hence, diag ® Ib(v\) is a well-dehned map for the given domain and range by A1.3. 

It has operator norm one by A1.3, B5.19, B5.18, and B5.5. It is completely positive by 
B5.18 and B5.8. By the dehnition of Bochner integrability, the simple functions of the 
form for disjoint, yU-measurable subsets {Bj} and {pj} C >Si(H) are norm-dense 

in {X; /r; iSi(H)). These have preimages in (L^ [X] ji) ® H) given by Ylj (isj ® Isj) Tj- 
Since diag®/g(H) is bounded (hence, continuous), it is surjective. To see that it is isometric 
on the positive cone V (L^ [X] p) ® H)’'", by the spectral theorem for compact operators, it is 
only necessary to check the rank-one case. By continuity and the dehnition of (df; yu) ® H, 
it is only necessary to check the case where the vectors are of hnite-tensor-rank, fj 
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for {fj} C and C H. Then, 

mm mm m « 

® ^3 ® '^f*k®Ck=Yl / f3Tk ^k) 

J=1 k=l j=l k=l j,k=l ^ 

( m m \ 

Y1 ^3 ® ij fk ® Ck ) (B.80) 

j=i k=i / 

so diag ® Ib{v\) is indeed isometric on the positive cone. 

By the preceding results, the map 

diag 0/^(H) : *Si (L^ (df;/i) 0 H)/her (diag 0 /^(h))( df;/r; 5i(H)) (B.81) 

is certainly a bijection. From the dense set of values considered above to establish surjectivity, 
it is isometric. By standard properties of Banach spaces, the adjoint map 

(diag(g)/H(H))* : (df;/r; 5i(H))* ^ image (diag(g)/H(H))* = W(df;/i; H) C B (df;/i) O H) 

(B.82) 

is a bijection. To see it is an isometry, take any T G (df;/i;iSi(H))*. By the dehnition of 
operator norm and the Cauchy-Schwartz inequality, there is the well-known formula for the 
operator norm in i3(J) for any Hilbert space J, 

||/3||op = sup (/5x,y) (B.83) 

x,y € J 

l|x|| < i,||y|| < 1 

Using this, we have 

IKdiag®/B(H))*d'||op = sup ((diag(g)/6(H))*d'C,r?) 

C,,rj £ D 

IICII<i.lhll<i 

= sup T ((diag( 8 )/H(H))(Ch*)) (B.84) 

IICII <bhll <1 

By the preceding lemma B5.20, this is certainly i|tt'i|op, since we only have to consider C, 
and 7] of the form 1b® and Is ® respectively, for /i-measurable B G A and unit-length 
G H, to get lld'llop. □ 

Proposition B5.22 The Banach space W{X;jj,;H) C B {L'^ {X; jT) ® H) dehned in the 
preceding proposition is a von Neumann algebra. 
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Proof By standard properties of Banach spaces, W(A’; /i; H) is precisely composed of those 
elements that annihilate the kernel of diag®/6(H); in other words, those 0 G B jj) ® H) 

snch that tr ipa = 0 for any a in the kernel. Now take any / G p) and consider the 

operator 7 = diag* f ® In- By nsing singnlar-valne-decomposition (as in the dehnition of the 
diag map in B5.17) together with approximating each resnlting vector by a hnite-tensor-rank 
approximation and taking the limit as the approximations become better, it can be shown 
that, operating on the left or the right, the kernel is invariant nnder the action of 7 . Take any 
/d G (L^(A’;p) ® H). By a similar argnment, it can be shown that diag®/g(H)(7/5— /dy) = 0; 
hence. 


0 = tr (p(7/3 - /^y) = tr /^((py - y(p) 


(B.85) 


nsing the linear and cyclic properties of the trace. Since /d was arbitrary, it mnst be that (p 
commntes with y; since / in the dehnition of y was arbitrary, ip mnst be in the commntant 
of {diagU~(T’;/i)®/H}. 

Now snppose ip is in the commntant of diag*L°°(A’; p)®/h. Now we will consider a certain 
norm-dense snbset of (L^ (df; /i) ® H): hrstly note iSi (df; p)) ® iSi (H) is norm-dense. 
Using (df; /i)) C ^2 (df; /i)) = L‘^{A x A; fxx jT), we can consider iSi (df; p)) as a 

space of “fnnctions” on A x A. Using Cartesian decomposition and the spectral theorem for 
compact operators (or singnlar valne decomposition), trnncating the spectral decomposition, 
then approximating each of the vectors in L‘^{A] p)-noTm by simple fnnctions, and then 
pntting everything back together with the proper intersections and relative complements of 
snbsets of A, the overall effect is to get something in the following snbset: 


A = 


iSi (H) -valned, simple fnnctions with rectangnlar 
snbsets Ax B for A, B in the finite, disjoint 
collection {Ci,..., Cm} of /i-measnrable snbsets of A 


(B.86) 


Therefore, the preceding snbset is norm-dense. 

Now let ns consider the snbset Aq <Z A which is composed of those iSi (H)-valned, simple 
fnnctions that are zero on all the rectangnlar snbsets that inclnde the diagonal; in other 
words, those that assign zero to each snbset of the form A x A. Then ^0 is norm-dense 
within the kernel of diag 0 Ib(h)- To see this, take any a in the kernel and let (oj) be a 
seqnence of elements of A converging in norm to it. Each Oj can be written as bj + Cj, 
where bj G ^0 and cj is zero everywhere except for the snbsets of the form A x A. It 
is easy to see that ||cj||trace = ||diag ® Iis[H)Cj\\. Since a is in the kernel, it mnst be that 
hmj_,.oo diag (g)/B(H)Cj = 0; hence, limj_,.oo ||cjHtrace = 0, so {bj) converges in norm to a. Since 
a was arbitrary, ^0 is indeed norm-dense in the kernel. 

Now again take any a in the kernel and let (bj) be a seqnence of elements of ^0 converging 
in norm to it. For each j, by the definition of A, there is some associated finite, disjoint 
collection {C (,..., of /i-measurable subsets of A] take fj to be any function that takes 

^®The commutant of a subset of an algebra is composed of all elements of the algebra that commute with 
the subset. 
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distinct values on each subset: say fj = ^ instance. Then bj can be written as 

(diag* fj ® /b(h)) dj - dj (diag* fj ® Ib{h)) (B.87) 

for a certain, unique dj G ^o- Hence, tr (fbj = 0 for every j, so, by continuity, tr i^a = 0. 
Since a was arbitrary, ip must be in the annihilator of the kernel; hence, it is in W(T:’; /x; H). 

The subset diag*L°°(A’; /x) ® Jh is itself the commutant of the C*-algebra given by the 
closure in norm of diag*L“(A’; /x)(8)i3(H), which is (since simple “functions” are norm-dense in 
L°°{A] /x)) given by {diag(S)lB(H))*L°° (H’; hi H(H)). Since W(T’; /x; H) is the bicommutant of 


a C'*-algebra, it is is a von Neumann algebra by von Neumann’s bicommutant theorem 110 

□ 


Comment Using the isometric bijections in B5.20, this induces an algebraic structure on 
(df;/x; 5i(H)). The identity operator in W(A’;/x;H) corresponds to the linear functional 
$ which agrees with the norm on elements in the positive cone {X; /x; ©(H))"*", 


= / ii p dp, 


(B. 88 ) 


' X 


where the trace acts pointwise. 


Proposition B5.23 The subspace ker (diag®/^( h)) is invariant under either left or right 
product with elements of >V(d:’; /x; H). 


Proof Take any a in the kernel, (p in >V(A’;/x;H), /x-measurable subset E G X, and 
-0, X G H. Then, for Pe = diag* Ig the orthogonal projector in B{L‘^{X] /x)) to the subspace 
of “functions” zero almost everywhere outside of E, 

/ (((diag 0 /B(H))a7’) x) dp = tr {Pe 0 ipX*)(^T = H “V = 0 (B.89) 

Je 

where a' = {Pe 0 is still in the kernel (as is readily seen by using singular value 

decomposition on a as in the dehnition of diag in B5.17). Since E,f)^x were arbitrary, it 
must be that (diag0/B(H))a95 = 0; hence, ap is still in the kernel. Similarly, using the cyclic 
property of the trace. 


/ (((diag 0 lB{H))pa) x/>, x) dp = tr {Pe 0 = H «'V = 0 (B.90) 

JE 

where a" = a{PE 0 'fx*) is still in the kernel. Since E, fj, x were arbitrary, it must be that 
(diag 0 lB{H))pa = 0 ; hence, pa is still in the kernel. □ 

^®The bicommutant is the commutant of the commutant. 







APPENDIX B. PROPOSITIONS FOR OPTION I 


138 


Alternate proof Note W(A’; /i; H) separates the elements of (A"; /i) ® H) /ker (diag® 

Jg(H))- To see this, take any a G iSi ® H). Then, by the dehnition of Bochner in- 

tegrability, (diag ® Ib{H))o: can be arbitrarily well approximated in L^(A’;p;5^(H))-norm 
by simple, 5^(H)-valued functions, where {Bj} is a hnite, disjoint collection of 

/i-measurable subsets of A. Use singular value decomposition to write each pj as UjTjV*, 
where Sj is a trace-class, diagonal operator in (or if dim H is hnite) and Uj, Vj are 

partial isometries from (or if dim H is hnite) to H. Then, f3 = ^^.(diag*!^^)01^17* G 

W(A; p; H) has the property that 


tr /3(diag(g)/B(H)) -E ^BjPj 

3 


3 


(B.91) 


Hence, by taking the approximation sufficiently close, there is some such j3 such that tr (3a 
is arbitrarily close to ||a||. (Note the Hahn-Banach theorem gives this result more easily, but 
it relies on the Axiom of Choice.) 

Using this, ker (diag 0 Ib{h)) can be characterized as 

[a G ® H) |tr /3a = 0 for all /3 G W(A;/i;H)} (B.92) 

Take any a G ker (diag 0 Ib{v\))- For any hxed [3 G >V(A; p; H), we then have tr 7 / 3 a = 0 
for all 7 G W(A; /i; H); hence, (3a G ker (diag 0 7b{h))- Also, 0 = tr (3'ya = tr '^a(3 for all 
7 G W(A; /i; H); hence, a/3 G ker (diag 0 Ib{h))- El 


Comment Now given any bounded, linear map L : —)■ L^(3^; z/; T>(J)), 

we can extend it to L : L^(3^; z/; iSi(J)) using B5.15. We then have the 

following two linear maps: 

L : Si (A; /i) 0 H) /ker (diag 0 Ib{H)) ( 3 ^; z/) 0 J) /ker (diag 0 /b(j)) (B.93) 

L* : W(3^;z/;J) ^ W(A;/x;H) (B.94) 

given by 

L = (diagi2(;t^.^) 0 /b(j))“^ oLo (diagi2(;t;/.) ® ^b(h)) (B.95) 

L* = (diagi2(;i..^) 0 /b(h))* o L* o (diag^2(;t';M) (B.96) 

Proposition B5.24 Given a bounded, completely-positive, linear map L : L^(A; p; ©(H))"^ 
L^{y] z/; T>(J)) + , the resulting L, L* are completely-positive in the sense of Stinespring’s the¬ 
orem 

^tr alaiL(i>ii>l) > 0 ^ (l’> 0 

i,k i,k 

for all hnite collections {a^} C W(33; z/; J) and {'^Zc} C T^(A; p) 0 H. 
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Proof The two expressions are related by duality, do it is only necessary to prove the left- 
hand one. The map L is the composition of completely-positive maps; hence, it is completely- 
positive. Therefore, for any hnite whole number n, L ® Im„ is completely positive, so for 
any (3 e W(3^; J ® €"■) and cp G L‘^{X]ij,) ® H 0 €"■, 

ti (B.97) 

For {oj} an orthonormal basis for C”, write (3 as Ylll ^ hV(3^;z/;J) and ip 
as i’j^j V'i G p) ® H. Then we have 

n 

^ tr > 0 (B.98) 

i^j^k=l 

If the aij are chosen to be independent of j, this gives the desired result. □ 

Comment Following the proof of Stinespring’s theorem, dehne the positive, bilinear form 
(•, •)stinespring ou >V(3^; P] J) 0 (g) H by 

® ® (^fc\ = ^ l3l^jL{'>PjTk) (B.99) 

\ j k / stinespring 

Let Af be the null subspace {a G W(3^; u] J) (g T^(A’; p) 0 H|(a, a) stinespring = 0}. Take any 
a e Af and f3 G W(3^;z/; J) (g) LA{X]ij) g) H; by considering {a -|- t/3, a -f- t/?) stinespring and 
(a -h itf3, a + ^t/3)Stinespring for small, real t, from the positivity of (•, ■)stinespring, h is evident 
(a, /3) Stinespring = 0. Then (•, •)stinespring is an inner product on >V(3^; u] J) ®L‘^{X]p)® H/A/"; 
this space can be completed to give a Hilbert space K. 

From the proof of Stinespring’s theorem, Af is invariant under the left action of /3 g 
for any (3 G W(3^;n; J). This then induces a representation of the von Neumann 
algebra, C, : W(3^; w, J) —?■ i3(K). The representation is readily seen to be a *-representatioi]p^ 

Proposition B5.25 The representation ( has operator norm less than or equal to one 
and is weak* continuous with pre-adjoint C* : ‘5i(K) Si {If {X; p) ® H) /ker (diagg /b(h)) 
being isometric on the positive cone. 

Proof By the dehnition of operator norm and the Cauchy-Schwartz inequality, there is the 
well-known formula for the operator norm in B{S) for any Hilbert space M, 

||/?||op = sup (/5x,y) 

x,y G M 
l|x|| < i,liy|| < 1 

representation C of a C^-algebra is a *-representation if C(o*) = C(q^)* for s-ny a. 


(B.lOO) 
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Using this and the norm-density of VV(3^; z/; J) ® ® H/A/” in K, we have, for any 

aeW{y-v-I), ||C(«)||op is given by 


with unit length using the Stinespring-norm 

(B.lOl) 

Absorbing a into a redefinition of each /3j, by the Cauchy-Schwartz inequality, this is less 
than or equal to ||Q;||op- Hence, ( has operator-norm less than or equal to one. 

To see ( is weak* continuous, take any (3 G 14^(3^; z/; J) and rank-one ip^p* G iSi(K), where 
both tjj and 99 are in the norm-dense subset W(A’; z/; J) ® H/A/”. Let 'ip E 'ip he 

written ® 0 and (p E (p he written 7 ^ 0 Tjj. Then 

tr C{l3)p)p* = l|3(DIL^x■,^p(^H'^P,p) = ^ (L*{^lPaj)^j,rik) 

\ / Stinespring ' ^ \ / 

= 'iWajU^jVD = tr iSCipjp*) (B.102) 

j,k 

where C,*{i’T*) = Note C{'ip'ip*) is positive since, for any {3 E W( 3 t; z/; J), 

(3* 13a,L{i,Ck>l>h (B.103) 

j,k 

using the cyclic property of the trace and B5.23. Then, 

||C(^^*)||=trC(^^*) = ||^ir (B.104) 

Hence, C* is bounded with operator norm less than or equal to two on rank-one operators 
with vectors in W(3^; z/; J) ® p)® H/A/”. Using A1.3, by continuity it extends to rank- 

one vectors with the same bound. By linearity, it extends to finite rank operators. Using the 
spectral theorem, it is still isometric on the positive cone; hence, it has operator norm less 
than or equal to two. By A1.3, it extends to all operators in 5i(K) while still being isometric 
on the positive cone and with operator norm less than or equal to two. Since C = C) C* 
actually has operator norm less than or equal to one. □ 

Proposition B5.26 There is a Hilbert space M and a partial isometry i? : K —)■ (A; /i)® 

H® M such that the representation (p is given by C(«) = E*[a®Iu)E for any a E W(A’; z/; J). 
The partial isometry E is such that the orthogonal projector EE* commutes with all 
If all the Hilbert spaces (A; /i), H, (3^; z/), and J have finite dimension, then M can be 
taken to be J ® dimL^ ® H. 
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Proof From a result by Sakai 109 , since C is a weak*-continuous, *-representatio]:]p^by the 
preceding proposition, ( is given by the composition of an amplification with an induction-. 
there exists a Hilbert space M and a bounded, linear map E : K (3^j ® JM such 

that C(<a) = E*{a ® Im)E, EE* is an orthogonal projector commuting with all {a ® /m}, 
EE*E = E, and E*EE* = E*. Then C* is given by, for any [3 e 5i(K), C*/? = tr mE/3E*, 
where tr m is the partial trace over M. Since, by the preceding proposition, is an isometry 
on the positive cone, it must be that E*E = Jk, so F is a partial isometry. Unfortunately, 
the proof given by Sakai is non-constructive; the only limitation on the dimension of M is 
that it need not be larger than dimK. 

If all the Hilbert spaces have hnite dimension, we can be far more concrete in the con¬ 
struction. Let M = J ® ® H. Since the Hilbert spaces have hnite dimen¬ 

sion, >V(3^; z/; J) is isomorphic to L°° (3^; u) ® H(J), which is isomorphic to the Hilbert space 

(3^; z/) (g) J 0 J using L°° (3^; z/) = (3^; z/) ancQ H(J) = 303. Then, using this isomor¬ 

phism, M maps to a subspace Af' of the Hilbert space (3^; z^) 0 3 (8) M. Also, then K is 
isomorphic to (3^; z^) ® 3 0 M /Af' = Af'~^ with no need for completion. Let U : K —>■ Af' be 
this isomorphism. Then the operators a ® IL^{x■,^i)^H acting on >V(3^; z/; 3) 0 (A; p) 0 H 

give rise to a representation a ® Jm on (3^; z/) ® 3 ® M. From the proof to Stinespring’s 
theorem, Af' is invariant under all the {a® Jm}, so the orthogonal projector commutes 
with all the {a ® /m}. 

Any point in the space (3^; z/) ® 3 ® M can be uniquely written as n -|- m for n G Af' 
and m G Af'^. Now put a new inner-product on the space, (•, ^new, dehned by 


{jn 71^ Tfl Ti )new (-^ (^); ^ ) ) Stinespring ^ ) 


(B.105) 


(This is the step that requires all the spaces to be hnite-the use of hniteness in the preceding 
steps is not essential.) With respect to this new inner product, A is a partial isometry. Let 
E* be the adjoint map with respect to the new inner product. Then the representation ( on 
K is provided by, for any fc G K, 

C{a){k) = (U-i o P^,x) (« ® lM)E{k) = {E*{a ® lM)E){k) (B.106) 

Hence, C(a) = E*{a ® Iu)E. □ 


Proposition B5.27 Any completely-positive map L G CP(L^(A’;/i; P(H)), L^(3^; z/; 1^(3))) 
can be expressed as 

Lp = (diag^ 2 (y.^) ® /b(j) ® tr M)V"((diagi 2 (^.^) ® Ib{h))~^ p)V* 

for some Hilbert space M and some linear operator V G (A; p) ® H, (3^; z/) ® 3 ® M). 
If L is an isometry on the positive cone, then U is a partial isometry. If all the Hilbert 
spaces L^(A;/i), H, L^(3^;z/), and 3 have hnite dimension, then M can be taken to be 
3 ® dimL^ (A;p,) ® H. 

^^Termed a W*-representation by Sakai. 

^^Using Mn = C" 0 C", say by using the Vec operation that stacks columns. 
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Proof By B.16, L extends uniquely to a map in G CV{L^{X] p; iSi(H)), L^{y', v] i5i(J)). By 
B.24, the resulting L* (defined in the comment preceding B.24) is completely-positive in 
the sense of Stinespring’s theorem. Also, the von Neumann algebra VV(3^; z/; J) (see B5.21, 
B5.22) of course contains the identity. Hence, by Stinespring’s theorem, L* can be expressed 
as, for any a G >V(3^; u] J), L*a = W*C{a)W for some W G B{L‘^ (A"; p) ® H, K) (where the 
Hilbert space K and the representation ( are defined in the comment preceding B.25). By 
the preceding proposition, C('^) = E*{a 0 Im)E for some Hilbert space M and some partial 
isometry E : K, (3^; u) ® J). Also, if all the Hilbert spaces (A; p), H, (3^; z/), and J 
have finite dimension, then M can be taken to be J ® dimL^ (A; p) 0 H. 

Let V = EW; then L*a = V*(a^lM)V, so L/3 = trM(V/3V*) for any /3 G Si(L^ (A;/i) 0 
H)/ker (diagi 2 (;t..^) ® Ib(h))- (As a check, if 7 G ker (diag^ 2 (;t..^) ® Ib(h)), then trM(A 7 'B*) is 
indeed in ker (cliag 2 ^ 2 ( 3 ;.y) ®-^z?(j)) by the alternate proof to B5.23 since tr (a ig) lM)V'yV* = 0 
for every a G W(3^; z/; J).) Writing L in terms of L then gives the desired form for L. 

If L is isometric on the positive cone, then L is isometric on the positive cone; hence, for 
any (3 G (A;/i) 0 H)+, 

tr /3 = tr L/3 = tr VI3V* = tr (3V*V (B.107) 

Therefore, it must be that V*V = It 3 {L^{x■,^J.)^H)', hence, it is a partial isometry. □ 

Proposition B5.28 Any linear, positive map L G i3(L^(A;/i),P(L^(A;p))) satisfying 
diag o L = It 3 {L^{x-,iJ,)) has image in the diagonal operators (see B4.3). 

Proof By the definition of the diagonal extraction map, diag, we have 

\b ■ diagp = diag^PEpPs) (B.108) 

for any p-measurable H G A with finite mw-measure, where Pb is the orthogonal projector 
to the subspace of “functions” zero almost everywhere outside B. Take any “function” / G 
L\A:;p). Then, 

diag PbL(1^b/)Pb = Is ■ diag o L(1^b/ = IbI^bI = 0 (B.109) 

The operator PbL(1....s/)Pb is clearly positive; by the isometry of diag on the positive cone 
(see B5.18), it must be the zero operator. Hence, (L(Ws/)lc; i-c) = 0 for any p-measurable 
C CB 

Now take any p-measurable, finite p-measure B' disjoint from B. By positivity, for any 
a G M, 

{L{l^Bf){(iIc — Ib'),ciIc — Ifi') > 0 (B.llO) 

For this to hold for all a, no matter how large, it must be that 


{L{l^Bf)^c, i-B') — {L{l^Bf)^B', Ic) — 0 


(B.lll) 
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Similarly, since B' B and C is disjoint from ~ B, 


1b') — 0 (B.112) 

Then 

0 = (L(1^b/)1c, 1^/) + (L(1b/)1c; 1b') = {L{f)lc, Ib') (B.113) 

Taking C = B, we get (L(/)ls, 1^/) = 0. Since B,B' were arbitrary, Lf is diagonal. □ 

B. 6 Vector measures 

Notation The spaces of finite-norm, vector measures on the given set with the given a- 
algebra will be denoted A^(T’;T;A), B),. ... These are Banach spaces using the 

total variation norm (give reference). The subset of these that is atomic will be denoted 
AiX;S;A),.... 

Proposition B6.1 The subset A(A;T; B) is a closed, linear subspace of the Banach space 
M(A; T; B) (hence, it is itself a Banach space). 

Proof The subset A(A;T; B) is clearly a linear subspace. To show it is closed, take any 
ju in the complement of A{X\E\ B). Suppose that for any e > 0 there were some countable 

collection of atoms {A^} C S such that ||/i \ IJ^. || < e. Then the countable union 

B= y yy (B,ii4) 

£e{i,2-i,2-3,...} j 

would satisfy \\fi {X \ i?) || = 0, which contradicts n being in the complement. Hence, there is 
some e > 0 such that, for any countable collection of atoms {Aj} C S, ||/i (^X \ IJ^. Aj^ || > e. 

Consequently, the distance from fj, to A{X\E\ B) is greater than e. Since fi was arbitrary, 
the complement of A(X;E; B) is open; hence, A(X;E; B) is closed. □ 

Proposition B6.2 If the Banach spaces A, B, and C are such that A® B is norm-dense in 

C, then A(X;E;A) 0A(y;X; B) is norm-dense in A(X x y;Q;C), where Q = a(E x X) is 
the (T-algebra generated by the rectangular subsets E x X. 

Proof The atoms of Q are in the rectangular subsets Ex X. Therefore, any vector measure 
in A{X xy^Q-^C) can be arbitrarily well-approximated in norm by vector measures in A{X x 
y-,Q-,C) of the form x ^Fj for hnite collections of atoms {Ej} C E and {Fj} C X and 

a hnite collection {cj} C C. By assumption, each c G C can be arbitrarily well-approximated 
in norm by elements in A(8)B. Hence, A{X-, E] A)^A{y', X] B) is norm-dense in A{X x y-,Q-,c). 
□ 
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Appendix C 

Propositions for option II 


Notation All topological spaces A, 3^, ... are compact and Hausdorff. C(A), C{y),... 
are the spaces of real-valued, continuous functions, which are Banach space employing the 
maximum norm. All partitions of unity will be assumed composed of continuous functions. 
Hilbert spaces, denoted H, J,. .. are complete, sesquilinear inner-product spaces, with no 
restriction as to their dimension or separability. /C(H), /C(J), ... denote the spaces of com- 
paclQ self-adjoint operators on the specihed Hilbert space; these are Banach spaces using 
the operator norm. C(A; /C(H)), C(3^; /C(J))), ... are the spaces of compact-operator-valued, 
continuous functions, which are Banach spaces employing the norm given by hrst applying 
the operator norm on the operators pointwise, then the maximum norm over the space. 


C.l Real-valued, continuous functions 


Comment The following proposition strengthens the well-known result, which is a special 
case of a result by Grothendieck 17 , that C(A x 3^) = C(A)®C(3^), where ® indicates 


completion in the injective norm[^ 


Proposition Cl.l The hnite-nonnegative-tensor-ranlj^ continuous functions are dense in 
the maximum norm topology for positive, continuous functions on A x 3^. 


Proof Let / be any positive, continuous function on A x 3^ and take any e > 0. Since / is 
continuous, for each {x,y) G A there are open subsets C A and C y such that 

f{U{x,y) X V^x,y)) c {f{x,y) - ^,f{x,y) + I). Since A X 3^ is necessarily compact, there is a 
hnite subcover, {U(^x,y)j x For each a; G A, dehne Ux by the intersection over all 

^An operator is compact if the image of a bounded sequence necessarily contains a convergent subse¬ 
quence. 

^The injective norm on A (g) B is the norm induced by its being a subspace of Bilinear(A*, B*), ||c||v = 
supc(y, b)) where the supremum is taken over ip in the unit ball of A* and in the unit ball of B*. 

^Using only positive real scalars. 
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U{x,y)j containing x. Similarly, for each y E y, define Vy by the intersection over all V(^x,y)j 
containing y. Since X and y are compact, there are hnite subcovers, and { 14 ^ }^^, 

where each f/j.. x 14 ^, is a subset of one of the U(x,y)j x V{x,y)j- Since compact, Hausdorff sets 
are normal, there are hnite partitions of unity, and dominated by 

and respectively 


103 


. Then 


I m 


EE f{xi,yi)(t)ii)k 


(C.l) 


i=l k=l 


is a hnite-nonnegative-tensor-rank continuous function that is everywhere on X x y within 
e of /. □ 


C.2 Maps on real-valued, continuous functions 

Corollary C2.1 Positive maps L G B{C{y),C{X))~^ are completely-positiv^ 

Proof Use Cl.l and the positivity of both maps L and Ib(C{z))- D 

Corollary C2.2 Maps L E B{C{Z),C{X)) and K E B{C{W),C{y)) satisfy {K Z L)*{y x 
u) = K*fi X L*i> for any Radon measures y on X and u on 3^. 

Proof By the preceding proposition, it is sufficient to demonstrate that for any hnite 
collections {fj} C C{Z) and {gj} C C(W), 

[ fj® 9j d{K 0 L)4/i X z/) = ^ f fj 0 gj d{K* y x LV) (C.2) 


iZxW 


iZxW 


but this follows from Tonelli’s theorem 96 , which is applicable since y, v are necessarily 
hnite measures, 

y] / fj® 9j d{K Z L)* {y X u) = Y] / ^4 ® L9j d{y x v) = Y\ / dCfj dy / Lgj du 

j Jzxw j Jxxy j Jx Jy 

(C.3) 

= V / 4 dK*y [ g^ dL*u = V / fjZ gj d{K*y x LV) □ 

. Jz Jw , JZxW 


Comment For any positive map L E B{C{y),C{X))~^ , clearly ||T||op = ||T(l 3 ;)||max- Hence, 
the cone of positive maps is clearly normal. 


map L is completely-positive if L (g) I]s{C(z)) is positive for every Z. 







APPENDIX C. PROPOSITIONS FOR OPTION II 


146 


Proposition C2.3 For any L G \\L\\op = ||^*(5x)||- 


Proof By definition of the operator norm, 


||L||op = sup ||L/||i„ax = sup |(L/)(a;)| 

/ec(y),||/|Uax<i 


/ e C(3^) 

||/||max<l 

X E X 


= sup 


Lfd5, 




f e C(3^) 

ll/llmax < 1 )>=SUp 

X E X 


fdL*{5, 




sup \\L (5x) II total variation D 

x(^X 


(C.4) 


/ e C(3^) 

ll/llmax < 1 

X E X 


Corollary C2.4 Maps L E B{C{Z),C{X)) and K E B{C{yV),C{y)) satisfy ||iF ®L||op = 

i|A'ii„piii|i„p. 


Proof Using the preceding proposition, 

\\KZL\\ 

op ^ sup ||(iF (g) L)*((5( 

a;, 3 /)) 11 total variation 

ix,y)eXxy 

Since S(x,y) = x 6y, from C2.3, this is equal to 

sup \\K (5x) X L (5y) IItotal variation 
{x,y)£Xxy 

By B1.5, this is equal to 

sup ||/f*(5x)||total sup ||F*(5y)||total 

variation 2 /sy variation 

which, using the preceding proposition again, is equal to ||iF||op||-h||op. □ 
Proposition C2.5 B(C(y),C(X)) is not in general a vector lattice. 


(C.5) 


(C.6) 


(C.7) 


Counterexample 

to be 


Take X = y = [—1,1] with the usual topology. Take L E 13(C([—1,1])) 


Lf{x) 


0 if a; < 0 

f{x) — f{—x) if a; > 0 


(C.8) 


Then L V 0 should be 


(LV0)/(a;) 


0 if a; < 0 

f{x) if a; > 0 


(C.9) 


but this sends some continuous functions to discontinuous ones. □ 
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Proposition C2.6 B{C{y),C{A)) is not in general directed-complete. 


Counterexample Take X = y = [—1,1] with the usual topology. Take Lj G i3(C([0,1])) 
to be 

Jo if X < 0 


Ljf{x) = 


if X > 0 


Then sup^ Lj should be 


(LV0)/(x) = 


0 if X < 0 


/(x) if X > 0 

but this sends some continuous functions to discontinuous ones. □ 


Proposition C2.7 For any positive L G B{C{y),C{X))~^ , there is a transition function 
r(-|-) : Opeuy x X, which is: (i) an additive set function on the open subsets of y; (ii) 
pointwise countably additive on the open subsets of 3^; and (Hi) lower-semi-continuous for 
any fixed open subset O C 3^-such that 

(L»(0)= f r(0|x)d/i(x) 

Jx&X 

for any open O C 3^ and Radon measure /i on T. 


Proof Let O be any open subset of 3^. Then, by the dehnition of Radon measures for open 


subsets in the proof of the Riesz-Markov theorem 78 


(L*/i)(0) = sup / / dL*n = sup / Lf dfi 

f€C{y) Jy f€C{y) Jx 

0<f<lo 


0 < / < lo 
sup 

f G C{y) JxGX 
0 < / < lo 


Lf d6x{x') I (i/i(x) = sup 
r:’GX J / e C(; 


fGCiy) JxGX \Jy 
0 < / < lo 


Then for any £ > 0, there is some continuous / between zero and Iq such that 


(L»(0)> J ^^\^J^fd{L*6x)j dMx)-- 


/ d{L*6x)j d/jifx) 
(C.IO) 

(C.ll) 


by 


Now consider the sequence of continuous functions {gj), each between zero and lo, given 


9j{y) = 


if dist(j/, ~ O) > 2 ^ 
2^ ■ dist(?/, ~ O) if dist(j/, ~ O) < 2~^ 


(C.12) 
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for j G Z"*" and y G y. Since y is compact and metric, / is nniformly continnons 104 59 , 
so there is some 5 snch that 


/(Ball(2/,<5))C /(y)- 




. /(a) + 


2||-h||op/i(df) 


(C.13) 


for any y G 3^.For j snfficiently large that 2 ^ < S, then gj is greater than or eqnal to / 
everywhere except for the set {y G 3^|dist(?/, ~ O) < 2~^}, on which it cannot be smaller by 
than 22 \\lImx ^ hence, 


/ diL*6,) dy{x)- / / / d{L*6,) ) dyix) < 

<x&x \Jy / Jx&x \Jy 

Therefore, 


(L»(0) > / / gjd{L*6^)] dyix) - e 

Jx&x \Jy 

Since e was arbitrary, it must be that 


{L*y){0) = lim / I lim / g^ d{L*6x) dy{x) 


(C.14) 

(C.15) 

(C.16) 


which is equal to 

[ ( lim [ gj d{L*6x)] dy{x) (C-17) 

Jx£X Jy J 

by the Dominated Convergence theorem [^. Using the Dominated Convergence theorem 
again, we have 


Also, 


(L»(0) = / lo diL*6x) dyix) = / (L*<5,)(0) dyix) 

Jx&x \Jy / Jx&x 


{L*Sx){0)= lim / gjd{L*6x)= \im {Lgj){x) 
Jy 


(C.18) 


(C.19) 


is measurable by (as the supremum over a sequence of increasing, continuous functions, 
it is actually clearly lower-semi-continuous). Hence, r(0|a;) = {L*Sx){0) has the desired 
properties. □ 


Alternate proof Let O C A’ be open. Dehne t{0\-) by 


/ 


t{0\x) = 


\ 


V 


f e C{y) 

\ 0 < / < lo 


ix 


(C.20) 
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Then ^(Ol-) is lower-semi-continuous, since, for any a G M, 


r(0|-) ^((a,cx)))= IJ (L/) ^((a,cx))) 


(C.21) 


/ e c{y) 

0 < / < lo 


which is the union of open sets; hence, it is open. Also, for any Radon measure /r, by inner 
regularity, 

(L»(0) = sup {L*n){K) (C.22) 


compact KdO 


By Urysohn’s lemma 105 , this is equal to 


sup / f{y)d{L ^T){y) = sup / {Lf){x) dy{x) (C.23) 

/ G c{y) Jy&y f £ c{y) Jx^x 

0</<lo 0</<lo 

which is less than or equal to t{0\x) dfi{x), which is less than or equal to 


( 


\ 


V 


f e c{y) 
V 0 < / < lo 


(/i) 


/ 


(C.24) 


which exists since dual spaces are necessarily directed-complete. This is in turn less than or 
equal to {L*y){0)] hence, {L*y){0) = J^^^t{0\x) dyi{x). □ 

The bounded, Borel functions on y, ^{y), form a Banach space using the supremum 
norm. The space B{y), acting on Radon measures via integration, can be considered a 
subspace of the linear functionals C{y)**. Note the preceding shows t{0\x) = {L*Sx:){0) 
and that L**lo = 't{0\-) is a function in B{X) rather than just a general pseudo-function in 


C(A’)” 113 


Proposition C2.8 For any positive L G B{C{y),C{A))^ , if 3^ is metric there is a transition 
function (see A3.1) 'r(-|-) : Borely x A such that 

{L*ii){B) = f t{B\x) djj,{x) 

Jx£X 

for any Borel subset B Gy and hnite Radon measure y on A. 


Proof Note we cannot just immediately begin with t{B\x) = Ldx{B) for a general Borel 
subset Bey because it is not evident that such a construction is measurable in x. Instead, 


we must proceed step-by-step in an inductive argument. Following Hausdorff 26 , let Go 


be the collection of the open subsets of 3^. For each ordinal a, let G^ be the collection 
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of subsets of y that are the countable intersection of subsets from the various collections 
for ordinals /3 < a if a is odd and that are the countable union of subsets from the 
various collections Gp for ordinals /? < a if a is even (where all limit ordinals-those without 
a predecessor-are taken even). For clarihcation, using the standard notation 106 , Gi = G 5 , 
G 2 = Gsa, and so on for the hnite ordinals. 

Fix any ordinal a and suppose that for all subsets B in all collections G^ for f3 < a 
we have the following property: there is a Borel measurable such that {L*fi){B) = 

t{B\x) dfi{x) for any Radon measure fi on X. Take any G G Ga- If a is even, we 
have a sequence {Bj) of subsets from the various Gjs with (3 < a such that G = [J^- Bj 
Let r(G|-) = \JjT{Bj 
Convergence theorem 


75 


Then 'r(GI-) is measurable (see j^) and, by the Dominated 
(L*/i)(G) = ^^^p^'t{G\x) dii{x) for any Radon measure fi on 
X] hence, since C was arbitrary, G^ has the property. Similarly, if a is odd, we have 
a sequence {Bj) of subsets from the various Gjs with (3 < a such that G = Let 

r(G|-) = l\-T{Bj\-). Then 'r(GI-) is measurable and, by the dominated convergence theo¬ 
rem, {L* fi){G) = t{G\x) dfi{x) for any Radon measure fi on X] hence, G^ also has the 
property. However, by the above proposition C2.7, Gq has the property, so by transhnite 
induction 27 (Ml all the Ga have the property. 

Since y is metric, its closed subsets are also in Gi = Gs (^; hence, following Kura- 
towski 43 , the Borel subsets of y are in the union IJ^ Ga- Therefore, the Borel subsets also 


have the property. □ 


Comment The preceding proof does not depend on the axiom of choice since the union 
only needs to be taken up to the ordinal number for the minimal uncountable well-ordered 
set 


43 , whose existence does not depend on the axiom of choice 55 


Corollary C2.9 For any positive L G B{C{y),C{X))~^ , if 3^ is metric then L** acting on 
Biy) C C{y)** has image in B{X) C C{X)**. 


Proof Use the density of simple functions in B{y) together with the above propositions 

C2.8 and A1.3. 


C.3 Compact-operator-valued, continuous functions 

Proposition C3.1 C{X; /C(H)) ® C(3^; /C(J)) is norm dense within C{X x 3^; /C(H (g) J)). 

Proof Finite-rank operators are norm-dense among compact operators, so proceed as in 
the hrst part of the proof for B4.1. Then following the argument in the proof of Cl.l 
involving partitions-of-unity gives the desired result. □ 
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C.4 Operator inequalities 

Proposition C4.1 For any operators /C(H), 

lll<^l - l^lllop < y^ll<^-^llop||<^ + ^||op 

where |x| = a/P- 

Proof By the properties of compact operators, | v^| — I'd is compact. By the spectral theorem 
for compact operators, there is a unit-length ip ^ H which is an eigenvector ip of \^p\ — |.^| 
and with corresponding eigenvalue A equal to |||v9| — |dllop in magnitude. Then, using the 
triangle inequality, 

lllv^l - Idlli = |A((|^| - ld)^,^)l < |A((|^| + ld)^,^)l (C.25) 

= ^ (d^i + Id) (1^1 - id)^,^) + ^ (d^i - Id) (1^1 + id)^,^) 

= ^,^>1 < ii<d^ -^^iiop = ^(<d+d(<^-d + ^(<d-d(<d+d 

op 

< - «)iiop+^ 11^ ■ ^“‘">11^+° 


The second needed inequality, which involves subsets of direct products of spaces of 
operators, is lengthy to state, although it has a very short proof. For real numbers a = 
(ai,..., ttn) in the simplex {x G [0, Ij^^lxi H-h = 1}, let A„(a, H) C /C(H)^’^ be the set 


(C.26) 


Let B(i3(H)) denote the closed, unit ball (using the operator norm) in i3(H). By the tri¬ 
angle inequality, B(f3(H))^"' C A„(a, H) for any allowed choice of a. Now consider the set 
C„(£,H) c/C(H)^" given by 


n 

si) 

i=i 

op ) 


|(0i,---,0n) e{B e /C(H)''” max||0j - dfcidp < e 

for e > 0. 

Dehne a distance from A G /C(H)^"^ to a subset F C /C(H)^"' by 


(C.27) 


dist (A, F) = m^max \\Aj — Ej\\op 


(C.28) 


Using this to dehne a Hausdorff distance between subsets then gives: 
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Proposition C4.2 The Hausdorff distance between An(a, H) fl Cn{s, H) and ]B(i3(H))^'^ fl 
is bounded by e. 

Proof Take any allowed a and any 0 G An{a, H)nC'„(e, H). Then, by the triangle inequality, 
for any j G {l,2 ,...,n}. 




k=l 


n 


op 


— II0j 0fc|lop 

k=l 


< e 


(C.29) 


However, by the dehnition of H„(a, H), Yllt=i^k(t>k is in B(i3(H)). □ 

Note the compactness of the operators was not used in the preceding proof so the proposition 
holds for the corresponding subsets of self-adjoint operators in H(H), although we will not 
require that generalization. 

For the following proposition, let {Aj} be any hnite collection of positive, Hermitian 
n X n-matrices and let P be any orthogonal projector. 


Proposition C4.3 There is a constant cq, independent of P, n, and r, such that 


||.4, - PAjPW trace ^ Cq 


7 = 1 


PAjP) 


7 = 1 


Proof Observe that for any Oi,..., C 1^ \ {0}, the quantity =r jg minimized over 

Xi,... ,Xm C 1^^ for hxed Xl^i when for each j, Xj = \aj\. Now consider the problem of 
minimizing the product tr D tr CD~^C* over strictly-positive, Hermitian, m x m-matrices 
D, given any n x m-matrix C. First take the case of n > m and C*C strictly positive. The 
minimization over D is the same as minimizing over its eigenvalues, Xi,... ,Xm C M"*", and 
eigenvectors, IF, with W unitary. For hxed value of W and the trace of Xi + ■ ■ ■ + 
using the preceding observation, this will occur for 


(xi H-h Xm)ej 

Cl + ■ ■ ■ + Cm 


(C.30) 


where Cj is the square root of the jth diagonal entry of W*C*CW. Then, tr D tr CD~^C* = 

(ci -I-Since squaring is monotonic for positive reals and the square root function is 

concave, this is minimized over all IF when IF^C^CIF is diagonal, so ei + - ■ ■+em = tr \/C*C. 

Therefore, tr D tr CD~^C* > ^tr \/C*c'^ . By the ability to embed matrices into larger 
matrices and the density of invertible matrices, this inequality actually holds for all n x m- 
matrices C. 
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By the triangle inequality, 
2 


0 c 

C* D 


< 


trace 


0 c 

C* 0 


+ 11/^1 


trace 


= (2tr VC^ + tr of (C.31) 


By the arithmetic-geometric mean inequality, this is less than or equal to 

8 ^tr VC*C^ + 2 (tr D)^ < 8 ^^tr VC*C^ + (tr D) 

which, by the above result, is less than or equal to 

8 ((tr CD-^C* +tT D) tr D) 


For any Hermitian matrix of the form 
sufficient) that B — CD~^C* is positive; ot 
{{B — CD~^C*)\^\) < 0, then the vector w = 


B C ' 
C* D 
aerwise, i: 


(C.32) 


(C.33) 


to be positive, it is necessary (and 
f there were some vector v such that 
would give 


V 

-D-^C*w 


B C 
C* D 


,^^ = {{B-CD-^C*)w,w) + {D[D-^C* Im]^,[D-^C* ] w) 

(C.34) 

= {{B -CD-^C*)w,w) < 0 


which is a contradiction. Therefore, for any positive, Hermitian matrix of the form 
with D strictly positive. 


B C 
C* D 


0 C 
C* D 


2 


trace 


< 8 ((tr B + tT D) tr D) 


(C.35) 


By the density of strictly-positive matrices among positive ones, this holds even if D is not 
strictly positive. Since any orthogonal projector can be brought into the form where there 
are ones along the upper-left diagonal and zeros everywhere else, this proves the case of 
r = 1, 

||H - PHPlItrace < Co\/tr {A - PAP)ti A (C.36) 

with Co = 2 \/2. 

The case of r > 1 then follows. Take any row vectors a, b,c C M"*" that are independent 
and whose projections to any of the subspace of dimension greater than two using diagonal 
projectors are still independent. Then, using Lagrange multipliers, the maximum value of 

xi, ... ,Xr > 0 for hxed value of the product (SI=i ^kXk) can 

only occur if all but two of the x^’s are zero. Even then, the maximum value can only be 
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attained for both x^-’s nonzero if it is also attained for one of them zero. Since the given 
conditions on a, b, c describe a dense subset, then it always the case that the maximizing 
value is attained when all the x/s are zero except for one. Then, since 


e;.i 11 - 4 , - PA,p\u,^, 
y(EI., (tr (. 4 , - PA,P))) (ELi f - 4 ,) 


(C.37) 


over all P, n, r, and {Aj}, is the same as 


^3 P-^3 trace 

sup — , 

\ (ELi (tr {Ak - PAkP))) (ELi ^i) 


(C.38) 


over all P, n, r, {xj} C M"*" U {0}, and {Aj}, it is the same as 


||7l-F/lF||trace 

sup — , 

^tr {A - PAP)tT A 


(C.39) 


over all F, n, and A, which is bounded by 2\/2 by the above. □ 


Remark Apparently cq 
replaced by 


2 is sufficient, since, by numerical calculation, (C.33) can be 


0 C 
C* D 


2 


trace 


< 4 ((tr CD-^C* + tr D) tr D) 


(C.40) 


where the bound is tight since equality is approached as e —)■ 0^ for D = e \/C*C. However, 
a proof of this tight bound is lacking. 


C.5 Maps on compact-operator-valued, continuous 
functions 

Proposition C5.1 Any positive map F G F (C(3^; /C(J)), C(A’; /C(H)))’'" satishes 

||F||op = sup ||F/|| = sup ||F(/Kly)|| (C.41) 

/ € C{y' K.{I)P finite-dimensional KcJ 

ii/ir<i 

Proof From the proof of C3.1, functions of the form gjcpj for hnite collections {4>j} C 
/C(J) and partition-of-unity {gj} are norm-dense in C(3^;/C(J)). Furthermore, for any e: > 0, 
it is possible to enforce the constraint that ||0j — 0fe|lop < if the support of gj intersects 
the support of gk while maintaining the density property. 
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Then, for any value of e > 0, ||L||op is equal to 


sup < 


i=i 


Now consider b{e) given by 


sup 


i=i 


partition-of-unity {gi,...,gn}, 

{0j}c/C(J),||0j-0fc||^p<e 

if support gj fl support gk ^ 0, 

ELi (I>j9j < 1 


partition-of-unity {gi,..., gn}, 

C /C(J),||0j-0fc||^p <e 

if support gj fl support gk ^ 0, 
UjWop < 1 


(C.42) 


(C.43) 


Note b{e) is a decreasing function of e, so its limit as £ —?• O’*" certainly exists. By C4.2, 


fe(^)<||^||op<fe(^)+^||^||op (C.44) 

so 

b{e) < l|i||„p < Pr (C,45) 

Hence, ||T||op is equal to lime_^o+ K^)- 

The additional constraint that all the {4>j} are positive can only reduce the supremum 
or leave it unchanged. On the other hand, since L is positive 


3 


sup sup V (L(^j0j)(a;)v,v) 

xeA’veH,||v||<i ““ 


(C.46) 


is certainly less than 




sup sup V (L(^j|0j|)(x)v,v) 

xeA’veH,||v||<i ““ 


(C.47) 


In addition, examining the constraints on {0^} in (C.43), || \(f)j\ ||^p = ||0j||op and, by C4.1, 
|||0il - |0fc||lop < -0fc|lopll</>i + 0fcllop ^ + 0fc|lop (C.48) 


By the triangle inequality and the other constraints, this is less than or equal to (so 
this new condition gives rise to a subset that necessarily includes the previous one), which 
can simply be replaced by e since e —)■ 0. Hence, the additional constraint that all the {0j} 
are positive can only reduce the supremum or leave it unchanged. Therefore, it must leave 
it unchanged. 
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However, now starting with sup f £ c{y-IC{J)A repeating the process would 

ii/ir<i 

lead to the same result, so it is only necessary to take the supremum over the positive cone. 
Finite-rank operators are norm-dense among compact operators, so it is only necessary to 
take the supremum over them. Any collection of hnite-rank operators live collectively on 
some hnite-dimensional subspace K C J. Since L is positive, the supremum for that particular 
K then occurs for the constant function with value Jk. □ 


Proposition C5.2 If L G H (C(3^;/C(J)), C(A’;/C(H))) is completely bounded, then, for any 
compact space Z and any Hilbert space K, \\L Z /||op < ||-h||matrix with I the identity map 
in i3(C(Z;/C(K))). 


Proof By the dehnition of operator norm and the dehnition of the tensor product of maps, 
\\L Z /||op is equal to 


sup |||(L (g)/)/|| 

By the preceding proof, this is the same as 


hnite-tensor-rank / G C(3^ x Z; /C(J 0 K) 
with II f II < 1 


(C.49) 


lim sup 


ZEE ^i,4^jkl9j') Z ^jklhk 

j=l k=l 1=1 


(C.50) 


m G {1,2,...}, 

partition-of-unity {gi, ..., gn}, partition-of-unity {hi, ..., hp}, 
{(j)ju} C }C{I),{Tjki} C /C(K), 

||X]/il(0iM ® Tjkl — (flj'k'l Z Tj'k'l)\\op < £ 

if support gj fl support gji ^ 0 and support hk fl support hk> ^ 0 
IE/=1 ^jkl TjklWop — ^ 

Writing 

n p m 

EEE 

j=i k=i 1=1 


(C.51) 


out as 


sup sup 

x(^X zdZ 


n p m 

EEE H4>jkigj){x) Z)Tjkihk{z) 


(C.52) 


then, since Z is compact, the maximum value is achieved for a certain x* (which depends on 
all the other quantities the supremum is taken over). Incorporating the value of hk{z^) into 
Tjkl, then the constraint: 

m 

^ ^ 4^jkl ® F/fcz 
1=1 


< 1 


nr^ 


(C.53) 
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becomes 


which implies 


^ ^ 4^jkl ® jkl 


l=l 


Ti hki^z^^ 


op 


k=l 


^ ^ 4^jkl ® pkl 


1=1 


< 1 


op 


which by the triangle inequality implies 

p m 


EE 4^jkl ® pkl 


k=l 1=1 


< 1 


op 


Similarly, the constraint: 




1=1 


< e 


op 


if support Qj n support gy ^ 0 and support hk fl support hk> ^ 0 


becomes 


^(0ifcZ ® Tiki — <Pj'kl ® 'Tj'kl) 


1=1 


< £hk{z^) 


op 


which implies 


E 

k=l 


if support Qj n support ^ 0 


^(0ifcZ ® Tjkl — <j)j'kl ® Tj'kl) 


1=1 


< e 


op 


if support gj fl support gj> 7 ^ 0 
which, by the triangle inequality, implies 

p m 


^(0iZcZ ® Tjkl — (j)j'kl ® Tj'kl) 


k=l 1=1 


< e 


op 


if support gj fl support gj' 7 ^ 0 


(C.54) 


(C.55) 


(C.56) 


(C.57) 


(C.58) 


(C.59) 


(C.60) 


Using the new Tjki together with the new, weaker constraints can only increase the value 
of the supremum. Furthermore, the sums over k and I can now be combined, yielding a 
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bound to ||L ® /||op given by 

m e {1,2,.. 

partition-of-unity {gi ,..., 

{0,4c/C(J),{r,fc}c/C(K), 

if support Qj n support gji ^ 0 
IIX]a :=1 ^jk ® 'Tj'fcllop — ^ 

Since the finite-rank operators are norm-dense among the compact operators, this bound 
is unchanged by requiring the Tjk to be hnite-rank. However, the hnite-rank operators 
collectively live on a hnite-dimensional subspace of K that is isomorphic to for some 
integer N < dim K. Incorporating this into the preceding expression for the bound, we have 

||^®/||op< sup ||L O Ja^^IIop □ (C.62) 

V<dim K 

Proposition C5.3 If the positive map L G H (C(3^;/C(J)), C(d:’;/C(H)))’'', compact space Z, 
and Hilbert space K are such that L®/ is positive, for / the identity map in B (C(Z; /C(K))), 
then \\L 0 J||op = \\L\\op. 

Proof If L 0 J is positive, then clearly L 0 J_a 4 „ is positive for every positive integer n less 
than or equal to dim K. By the preceding proposition, it is therefore only necessary to show 
that if L<S)lMn is positive, then ||L®/^„||op = ||f^i|op- However, if L<S)lMn is positive, then 

by C5.1, 

\\L®ImJ\op= sup ||(L(g)/^„)(/L®C"ly)|| (C.63) 

finite-dimensional LcJ 

= sup sup ||L(JLly)(a;) (g) Jnllop 

finite-dimensional LcJ x£X 

= sup sup ||L(/Lly)(a;)||op 

finite-dimensional LcJ xGA’ 

which is ilh^llop by C5.1. □ 

Corollary C5.4 The completely-positive maps 

H(C(3^;/C(J)),C(T’;/C(H))r 

are completely bounded. 

Corollary C5.5 The cone of completely-positive maps 

H(C(3^;/C(J)),C(T’;/C(H))r 

is normal in either the induced operator norm or the matrix norm. 
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Proposition C5.6 If L G 13 (C(3^;/C(J)), C(A:’;/C(H)))^^, then, for any compact space Z 
and any Hilbert space K, L ® / is positive, with / the identity map in B (C(Z; /C(K))). 

Proof Since L is completely positive, by C5.4 it is completely bounded. Hence, L Z I 
exists by C5.2. Furthermore, by C3.1 and A1.3, L 0 J is unique, so it is meaningful to 
speak of it being positive. 

Now suppose there were some compact space Z and some Hilbert space K such that 
L Z I were not positive. Then there would be some positive / G C{y Z ^;/C(J ® K))+ 
such that {L Z I)f is not positive. Since the cone C{A Z Z\ /C(H ® K))+ is norm-closed and 
L ® / is continuous, that implies there is a relatively open neighborhood of / in the cone 
C{y Z Z] /C(J ® K))+ whose image under L Z I does not intersect C{X Z Z; /C(H 0 K))+. 

Now approximating / as in the proof of C5.2 as Tjki ®Tjki Qj ® hk (but without the 
need for any e-constraints), one hnds that for this to occur there must be some n G {1, 2,.. .} 
for which L Z Im„ is not positive; however, that is a contradiction. □ 

Proposition C5.7 If either dimH or dimJ is finite and a positive map 
L G 13 (C(3^; /C(J)), C(A’; /C(H)))^ is such that LzIm^ is positive for m = minjdim H, dim J}, 
then L is completely positive. 

Proof Clearly, since L Z iMm is positive, so is L ® /_a 4 „ for all n < m. Now take n > m. 
L Z Imu 'will be positive if for every / G C{y; /C(J ® C”))+, y G H 0 C”, and x E A, 

{{{Lz iMn)f){x)y,y) >0 (C.64) 

It is enough to show this for / in a dense subset, so / can be restricted to the form ipjQj 
for {gj} a partition-of-unity and {(pj} a collection of compact operators. By the spectral 
theorem for compact operators, it is enough to show this for the <pj’s all rank one. Following 
the argument of B5.9 then gives the desired result. □ 

Proposition C5.8 The space of completely bounded maps, 

CH(C(3^;/C(J)),C(A;/C(H))) 
is a Banach space with respect to the matrix norm. 

Proof Same argument as for B5.10 □ 

Proposition C5.9 The subset of H (C(>’;/C(J)), C(T’;/C(H))) for which the tensor product 
with is positive for some hxed n G {1, 2,.. .} is closed in the weak topology. 
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Proof We will show the complement is open. Take such a map L that is not in the subset. 
By the argument in the proof for C5.7 and B5.9, that implies there are some x E A, 
partition-of-unity {gj}, hnite collections of vectors {wj^} C J and {v^} C H, and e > 0 such 
that 

m n 

EE {L(wjk<S)W*i gj){x)\i,\k) < -£ (C.65) 

j=l k,l=l 

Then, by the triangle inequality, all the maps in the weak neighborhood 


nn Af i^L; WjkW*,^ gj; 

j=l k,l=l 


'I ’Jx-j 


2 rPm 


(C. 66 ) 


m n 

+nnn-v(i ; (Wjv o w*; + Wji (g) w*^) gj-,\k ® j 

j=\k=lr<l 
m n 

+nnnA^( L- Wjk ® gj] (Vfc O v;; + 0 Vfc) 

j=lk=lr<l 
m 

+nnn-^ 

j=lk<l q<r 


{wjk ® w*i + Wji (g) w*;;,) gj] (Vg (g v; + V,. (g) vp 4; 


2 e 


will also fail to yield a positive tensor product with Imu- 


Corollary C5.10 The cone of completely positive maps is weakly closed in 
S(C(3^;/C(J)),C(T’;/C(H))). 


Comment See the comment following B5.12 concerning the use of the axiom of choice. 


Proposition C5.ll For any positive functional <h G C(A’;/C(H))*, there is some I1(H)+- 
valued. Radon vector measure g, such that 

= [ fdg 

J X 


Proof Since /C(H)* = T’(H), there is some p G such that = tr pcj) for any 

(j) G /C(H). Since hnite rank operators are dense in compact ones, p lives on a separable 
subspace of H; let { 0 ^} be an orthonormal basis for this subspace and {Pj) an increasing 
sequence of projectors onto the subspaces spanned by the hrst j basis vectors. Applying the 
Riesz-Markov theorem entry-wise, there is a sequence of Radon vector measures, (/ij), with 
each Pj taking values in V{PjH)^. Take j > k] then \\gj — g^W is given by 


sup < ^ \\Tj{El) - /ifc(^z)||trace 

(1=1 


r G {1, 2,...}, disjoint, Borel 
subsets {El ,..., En} 
with El = A 


(C.67) 
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Applying C4.3, - /ifc(-Ez)||trace is less than or equal to 


Co 




^tr - fik{Ei)) I ( Tj{Ei) 


, i=i 


. i=i 


(C.68) 


= Co ^JtT (/ij(A’) -/ifc(A’))tr/ij(A’) 


which is less than or equal to Cq y/tr {p — p. This goes to zero as fc —)■ cxd since 

the ^k{Aiys are truncations of p, which converge to p in trace norm, which can be seen 
either by using C4.3 again or by hrst demonstrating that the truncations converge in norm 
for rank-one p (which is readily shown); by the spectral theorem for compact operators, 
the truncations converge in trace norm for every p. Therefore, (/ij) is a Cauchy sequence. 
Since 'D(H)~''-valued vector measures are complete with the given norm 108 , the limit is the 
desired p. □ 

For the following proposition, restrict the Hilbert space H to be separable. Then there is 
some p G P(H)'*’ that is strictly positive. Let {ej} be an orthonormal basis for H composed 
of eigenvectors of p, with eigenvalues Xj in decreasing order. For any Radon measure p on 
A, by the preceding proposition there is the induced variation measure u = \L*{plxp)\ on 


3^, which is also Radon 108 . Then we have the following result: 


Proposition C5.12 For any positive L G H (C(3^;/C(J)), C(A;/C(H)))’'", the adjoint map 
L* induces a map K that sends p]V{H))^ into . 


Proof Let C C 3^ be any closed, z/-null subset. By outer-regularity, there are open sets 


containing C with arbitrarily small i/-measure. Therefore, by Urysohn’s lemma 105 


0 = v{A) = inf [ f dv 
fec{y),ix>f>ic Jy 

Hence, by the dehnition of i/, for any 0 G /C(J)+, 


(C.69) 


0= inf / ti pL((j)f)(x) dp(x) 

NC{y),iy>f>ic 


(C.70) 


For A any Borel subset of X and r any positive, self-adjoint operator on the span of hnitely 
many of the {oj}, any single-term simple function, tIa, can be scaled to be less than pl^’- 
Then 


0 = inf 

feC{y),ly>f>lc . 


l^(a;)tr TL{(j)f)[x) dp{x) 


(C.71) 


' xGX 


Now take {xj} and {Aj} to be hnite collections of such operators and subsets. For allowed 
the pointwise product /i ■ ■ ■ /n is also allowed and is less than or equal to each of 
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the fj] hence, 


It p 

inf ^ / 

NC{y),iy>f>\c AeA” 


lA, (a:)tr TjL{(j)f){x) d^{x) 


(C.72) 


= inf 

^ fec{y),ix>f>ic 
j=i 



lA, (a;)tr TjL{(j)f){x) dfi{x) = 0 


Since simple functions of the form D are dense in p; P(H))+, then for any 

element ^ G 


0 = inf 

feC{y),ly>f>lc 


/ ti Dx)L((pf)(x) d^(x) = inf 
L&x f&c(y),iy>f>ic. 


fd{cl>L*{^f^)) (C.73) 




where (j)L*{^fi) is a Radon measure. Therefore, it must be that {(f)L*{^^)){C) = 0. Since 0 
was arbitrary, L*{^fi){C) must be zero. 

By inner regularity, any Borel set can have its measure approximated arbitrarily well by 
closed sets it contains. Therefore, for any i/-null, Borel subset A, L*{^fi){A) is also zero. 
Therefore, is absolutely continuous with respect to z/. Since T’(H) has the Radon- 

Nikodym property [^, there is a -0 G T^(3^; u] 'D(J))+ such that L*jj, = 'ifju, which gives the 
desired map K hj = 'll). □ 


